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A method for isolating a gene, comprising: (i) isolating a cell possessing a post-translatJonal characteristic of interest, said post- 
translational characteristic being the presence of a membrane-bound oligosaccharide or polysaccharide of interest on the surface of said 
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material of said isolated cell; (iii) transforming host cells with said genetic library; and (iv) screening said transformed host cells for a host 
cell containing said post-translational characteristic thereby obtaining a cell containing said gene, is disclosed. The method can be used to 
obtain genes encoding glycosyltransferases. 
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TITLE OF THE TNVENTTQM 

"CHIMERIC FUCOSYLTRANSFERASE POLYPEPTIDES AND NUCLEIC 
ACID SEQUENCES ENCODING THEM". 



BACKGROUND OF THE TWVF^ J ffW 

This work was supported by National Institutes of Health 
Grants GM47455 and Kll DK02189, and the U.S. Government may 
have rights in this 



Field of the Invent i PT1 g 

The present invention relates to methods and products for 
the synthesis of oligosaccharide or polysaccharide structures, 
on glycoproteins, glycolipids, or as free molecules. The 
present invention also relates to novel chimeric 
f ucosy ltransf erases • 



Discuss ion of the Background ; 

Carbohydrates are an important class of biological 
compounds which are remarkable for their structural diversity. 
This diversity is not random but rather consists of specific 
sets of oligosaccharide structures that exhibit precise 
tissue-specific and developmental expression patterns. In 
cells carbohydrates function as structural components where 
they regulate viscosity, store energy, or are key components 
of cell surfaces. Numerous site specific intercellular 
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interactions involve cell surface carbohydrates. For example, 
union of sperm and egg as well as the implantation of 
fertilized egg are both mediated by cell surface 
carbohydrates. Likewise, a number of proteins that function 
as cell adhesion molecules, including GMP-140, Endothelial 
Leukocyte Adhesion Molecule-1 (ELAM-1) , and lymphocyte adhesion 
molecules like Mel-14, exhibit structural features that mimic 
lectins, and are now known to bind specific cell surface 
carbohydrate structures (Feizi, Trends Biochem. Sci. (1991) 
16:84-86). Glycosylated proteins as tumor-associated antigens 
are now being used to identify the presence of numerous 
carcinomas. Even isolated oligosaccharides have been found to 
exhibit biological activity on their own. 

Specific galactose oligosaccharides are known to inhibit 
the agglutination of uropathogenic caliform bacteria with red 
blood cells (U.S. Patent No. 4,521,592). Other 
oligosaccharides have been shown to possess potent 
antithrombic activity by increasing the levels of plasminogen 
activator (U.S. Patent No. 4,801,583). This same biological 
activity has been used, by binding oligosaccharides, in 
conjunction with an amino glycoprotein, in medical instruments 
to provide medical surfaces which have anticoagulation effects 
(U.S. Patent No. 4,810,784). Still other oligosaccharides 
have found utility as gram positive antibiotics and 
disinfectants (U.S. Patent Nos. 4,851,338 and 4,665,060). 
Further, oligosaccharides have been used as bacteria receptor 
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sites in the diagnosis and identification of specific bacteria 
(U.S. Patent Nos. 4 f 657,849 and 4,762,824). 

It is also well recognized that oligosaccharides have an 
influence on the protein or lipid to which they are conjugated 
fRademacher et al. Ann. Rev. Bioehem, . (1988) 57:785). 
Specific oligosaccharides have been shown to influence 
proteins' stability, rate of in vivo clearance from blood 
stream, rate of proteolysis, thermal stability and solubility. 
Changes in the oligosaccharide portion of cell surface 
carbohydrates have been noted in cells which have become 
cancerous. Other oligosaccharide changes have been detected 
during cell differentiation f Toone et al . Tetrahedron Report 
(1989) 151121:5365-5422). As such, the significance of 
oligosaccharides to biological function cannot be understated. 

The fundamental role of these materials in molecular 
biology has made them the object of considerable research, in 
particular, considerable efforts have been made in organic 
synthesis to synthesize these materials. Although synthetic 
approaches to making carbohydrates are quite developed, this 
technique suffers notable difficulties which relate to the 
selective protection and deprotection steps required in the 
available synthetic pathways. These difficulties, combined 
with difficulties associated with isolating and purifying 
carbohydrates, and determining their structures, has made it 
essentially impossible for synthetic organic chemistry to 
economically produce valuable carbohydrates. 
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Enzyme-mediated catalytic synthesis would offer dramatic 
advantages over the classical synthetic organic pathways, 
producing very high yields of carbohydrates (e.g., 
oligosaccharides and/or polysaccharides) economically, under 
mild conditions in aqueous solution, and without generating 
notable amounts of unde sired side products. Such enzymes, 
which include glycosyltransf erases, are however difficult to 
isolate, especially from eukaryotic, e.g., mammalian sources, 
because these proteins are only found in low concentrations, 
and are membrane -bound. 

As of 1987, standard molecular cloning approaches which 
require amino acid sequence information or anti- 
gly cosy 1 transferase antibodies, had been successfully used to 
isolate just two eukaryotic, e.g., mammalian 
glycosyltransf erase cDKAs, corresponding to 
0(1,4) galactosyltransf erase (in 1986) and 
a (2 , 6) sialyltransferase (in 1987). In light of the above- 
noted considerable value of carbohydrates, there is 
accordingly a strongly felt need for an improved method for 
isolation of additional glycosyltransferase genes and cONAs 
and for their use in carbohydrate synthesis. 



WO 97/09421 





YUS96/13816 



-5- 



SUMMARY OF THE II 



Accordingly, it is an object of this invention to provide 
a method for readily isolating eukaryotic, e.g., mammalian 



glycosyltransf erase genes and cDNAs . 

It is another object of this invention to provide a 
method to modify these isolated genes and cDNAs to obtain 
correspondingly modified glycosyltransf erases . 

It is another object of this invention to provide these 
unmodified and modified isolated genes and cDNAs, and to use 
them, for example, in modifying cell surface oligosaccharide 
structure via gene transfer approaches or via in vitro 
glycosylation reactions. 

. The inventor has now discovered a gene transfer approach 
which satisfies all of the above-noted objects of this 
invention, and other objects which will be seen from the 
description of the invention given hereinbelow. The present 
methodology takes advantage of existing information about 
substrate and acceptor properties of glycosyltransferases and 
makes use of the numerous antibody and lectin reagents that 
are specific to the cell surface-expressed oligosaccharide 
products of these enzymes. The inventors have also discovered 
that useful chimeric fucosyltransf erases can be prepared from 
enzymes so obtained. 



* 



* 
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BRIEF DESCRIPT ION OF THE FIGURES 

Figures 1, 2, 3, 4, 5, 6, and 7 provide six DNA sequences 
provided by the invention, encloding glycosyltransf erases. 

Figure 1 shows the DNA sequence encoding a protein 
capable of functioning as a GDP-Fuc: [0-D-Gal(l,4/l,3)-D- 
GlcNac (/Glc) -a- (1,3/1, 4) -fucosyl transferase (Lewis 
fucosyltransf erase, Fuc-TIII) (SEQ ID NO:l) and the amino acid 
sequence of the encoded protein (Fuc-TIII) (SEQ ID NO: 2). 
Figure 2 shows the DNA sequence encoding a mouse UDP-Gal:j8-D- 
Gal-(i,4)-D-GlcNac a (1,3) -galactosyl transferase (SEQ ID NO: 3) 
and the encoded protein (SEQ ID NO: 4). Figure 3 shows the DNA 
sequence encoding a human GDP-Fuc : 0-D-galactoside a (1,2)- 
fucosyltransf erase (SEQ ID NO: 5) and the amino acid sequence 
of the encoded protein (SEQ ID NO: 6) • 

Figures 4 and 5 provide DNA sequences (SEQ ID NO: 7 and 
SEQ ID NO: 9, respectively) encoding a GDP-Fuc: [#-D-Gal(l,4) ]- 
D-GlcNacat ( 1 , 3 ) -fucosyltransf erase (Fuc-TIV) and the encoded 
protein (Fuc-TIV) (SEQ ID N0:8). Figure 5 also shows the 
amino acid sequence of the Lewis fucosyltransferase (Fuc- 
TIII) (SEQ ID NO: 2) . 

Figure 6 provides a DNA sequence (SEQ ID NO: 10) encoding 
a GDP-Fuc: [0-D-Gal (1, 4) ]-D-GlcNAca( 1,3) -fucosyltransferase 
(Fuc-TV) (numbered upper nucleotide strand) and its 
corresponding protein sequence (Fuc TV) (SEQ ID NO: 11), 
together with the DNA sequence of the Lewis blood group 
fucosyltransferase (Fuc-TIII) (unnumbered lower sequence) . 



WO 97/09421 




VUS96/13816 



-7- 



Amino acid differences with the Lewis fucosyltransf erase are 
indicated by inclusion of Lewis amino acids below the Lewis 
DNA sequence (SEQ ID NO: 12). The transmembrane domain of the 
fucosyltransf erase is underlined. 

Figure 7 provides a DNA sequence of the coding portion of 
the genomic DNA insert in pCDNAl-a (1,3) Fuc-TVI (SEQ ID NO: 13), 
and parts of the 5' and 3' regions of that gene* DNA sequence 
comparison between the GDP-Puc: [/J-D-Gal(l, 4) ] -D-GlcNaca ( 1 , 3 ) - 
fucosyltransferase (Fuc-TVI) encoded by the genomic DNA 
fragment in pCDNAl-cr (1,3) Fuc-TVI (labeled cr(l,3)FT DNA) and 
the Lewis blood group fucosyltransferase (Fuc-TIII) (labeled 



Lewis DNA) (SEQ ID NO: 12) is also shown in Fig. 7. Positions 
of DNA sequence identity are denoted by a vertical line ( | ) 
between identical nucleotides at similar positions. Positions 
where the sequences are out of register are denoted by ( . ) . 
The derived protein sequence of Fuc-TVI , in single letter 
code, and labeled a(l,3) FT AA, is indicated above its DNA 
sequence (SEQ ID NO: 14). 

Figure 8 presents flow cytometry profile histograms of 
COS-l cells transfected with different 

a (1,3) fucosyltransferase gene expression vectors, or with 
control vectors, and subjected to analysis with monoclonal 
antibodies directed against cell surface oligosaccharide 
determinants. COS-l cells were transfected either with 



plasmid pCDM7 (pCDM7) , with plasmid pCDNAI (pCDNAI) , with pCDM7 
containing DNA sequences encoding the 
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a (1,3/1,4) fucosyltransf erase depicted in Figure 1 (the Levis 
fucosyltransf erase, also known as Fuc-TIII, SEQ ID NO:2), or 
with pCDNAI containing DNA sequences encoding the 
a (1,3) fucosyltransf erase depicted in Figure 4 (Fuc-TIV, SEQ ID 
NO: 8), the a (l, 3) fucosyltransf erase depicted in Figure 6 (Fuc- 
TV, SEQ ID NO: 11), or the a (1,3) fucosyltransf erase depicted 



in Figure 7 (Fuc-TV, SEQ ID NO: 14). Three days after 
transfection, the cells were harvested, stained with 
monoclonal antibodies (shown at the top left within the 
figure) directed against the H (anti-H) , Lewis x (anti-Lex) , 
sialyl Lewis x (anti-sLex) , Lewis a (anti-Lea) , or sialyl 
Lewis a (sLea) oligosaccharide determinants, and then stained 
with a fluorescein-conjugated second antibody. The cells were 



then subjected to analysis by flow cytometry. The histograms 
represent the mean fluorescent intensities of the antigen- 
positive cells in each trans fectant population (approximately 
25% to 30% of the cells are transfected and express the 
positive cell surface markers) . Methods for these analyses 
have been described in detail in Lowe et al r J, Biol. Chem. . 
(1991), 266 :17467-17477. Weston et al . J, Biol. Chem. . (1992), 
2£2: 4152-4160, Lowe et al . Cell , (1990), 63:475-484, and Ernst 
et al . J, Biol. Chem* . (1989), 264:3436-3447. 

Figures 9a and b show a comparison of protein and 
DNA sequences of Fuc-TIII and Fuc-TVI. Fig* 9a, Fuc-TIII and 



Fuc-TVl are schematically represented by open rectangles. The 
transmembrane domain of each enzyme is denoted by a striped 



* 
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) . Vertical lines within the rectangles denote the 



positions of amino acid residues that differ between the two 
enzymes. These differences are concentrated in the 
"hypervariable" segment between the Mill and Bgl II 
restriction sites shown under the rectangles. The carboxyl- 
terminal regions of these enzymes, distal to the Bgl II site, 
differ at 7 positions. The relative positions of these 
differences, and their nature (six non-identical amino acids 
and one amino acid/ insertion; identified in single letter 
amino acid code) are also indicated in the figure. Sites of 
potential asparagine- linked glycosylation are indicated by V • 
The disaccharide portions of the potential oligosaccharide 
acceptors are also shown in the figure. "LacNAc" denotes the 
type II precursor tf-acetyllactosamine. LNB-I denotes the type 
I precursor lacto-N-biose I. Arrows near the disaccharide 
moieties point to the hydroxy 1 group that will be modified by 
fucosylation. R denotes the underlying glycocon jugate that 
displays the disaccharide. R' represents either a hydroxy 1 
group in the neutral precursors for Lewis x or Lewis a 
products, or a (2 , 3) -linked sialic acid in the sialylated 
precursor for the sialyl Lewis x or sialyl Lewis a products. 
Fuc-TIII (top) can utilize GDP-fucose to form products from 
both type II and type I precursors, whereas Fuc-TVI (bottom; 
shaded to match Figs. 10, 11a and b, and 12a and b) can 
utilize GDP-fucose to form products only from the type II 
precursors. 
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Fig. 9b, Subdomains within the "hypervariable" segments 
of Fuc-TIII and Fuc-TVI. The DNA and derived protein 
sequences of the two enzymes are aligned. The upper DNA and 
protein sequences correspond to Fuc-TIII. The lover DNA and 
protein sequences correspond to Fuc-TVI. Dotted lines (....) 
indicate positions in one enzyme that do not have a 
counterpart in the other. Residues are numbered beginning 
with the initiator methionine codon of Fuc-TIII (residue 
number 1) . Non- identical amino acid residues are shaded. 
Underlined DNA sequences correspond to the synthetic 
oligonucleotides used to create or destroy restriction sites 
(positions and identity indicated above, for Fuc-TIII, or 
below, for Fuc-TVI) that define the boundaries of each 
subdomain. The nature of the DNA sequence changes needed to 
create these sites are denoted above (Fuc-TIII) or below (Fuc- 
TVI) the wild type DNA sequence. As noted in the text, these 
changes maintain the wild type protein sequence of each 
enzyme . 

Figure 10 shows flow cytometry histograms of COS -7 cells 
trans feet ed with recombinant Fuc-T chimeras generated via 
carboxyl -terminal domain exchanges. Fuc-TIII (open 
rectangles) and Fuc-TVI (shaded rectangles) are illustrated as 
in Figure 9a. Recombinant chimeras derived by restriction 
fragment exchange procedures (see Experimental Procedures) are 



composed of segments of Fuc-TVI (shaded areas) and Fuc-TIII 
(open areas) . Transmembrane segments are denoted by the 
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darkly shaded regions near the NH 2 -termini of each 
schematically represented enzyme. A symbol ( V ) denotes the 
positions of potential asparagine-linked glycosylation sites. 
Relative positions of the five subdomains in the 
"hypervariable" region are numbered. Positions of the Bglll 
and BstXl sites used to create the chimeras in this figure are 
indicated below each schematically represented enzyme. COS -7 
cells transfected with vectors encoding each enzyme, or a 
negative control vector (pcDNAI) were stained with the 
monoclonal antibodies indicated in the legend in the figure, 
and analyzed by flow cytometry. Histograms display the 
fraction of antigen-positive cells (normalized for 
transfection efficiencies, as described in the Examples, and 
represent the average of either two or three experiments. The 
mean fluorescence intensity (M.F.I. ) of the antigen-positive 
cells are indicated by the numbers at the left of each 
histogram, above the arrow (t) at the bottom of each panel. 
Numbers in parenthesis in this column denote the mean 
fluorescence intensity of the antigen-negative cells whenever 
the experiment yielded no antigen-positive cells. 

Figures 11a and b show flow cytometry histograms of COS -7 
cells transfected with recombinant Fuc-T chimeras generated 
via single subdomain exchanges. Symbols, legends, and 
nomenclature are identical to that used in Figure 10. The 
five subdomains in the "hypervariable" region of each enzyme 
are numbered. COS-7 cells transfected with vectors encoding 
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each enzyme, or a negative control vector (pcDNAI) were 
stained with the monoclonal antibodies indicated in the legend 
in each panel, and were analyzed by flow cytometry. 
Histograms display the fraction of antigen-positive 
trans fee ted COS-7 cells (normalized for transfection 
efficiencies, as described in Experimental Procedures), and 
represent the average of either two or three experiments. 
Mean fluorescence intensities (M.F.I.) of antigen-positive 
cells are indicated by the numbers at the left of each 
histogram , above the arrow (t) at the bottom of each panel. 
Numbers in parenthesis in this column denote the mean 
fluorescence intensity of the antigen-negative cells whenever 
the experiment yielded no antigen-positive cells. Figure 11a, 
single subdomains from Fuc-TVI (shaded areas) installed into 
Fuc-TIII (open rectangles) . Figure lib, single subdomains from 
Fuc-TIII (open rectangles) installed into Fuc-TIII (shaded 
rectangles) . 

Figures 12a and b show flow cytometry histograms of COS-7 
cells transfected with recombinant Fuc-T chimeras generated 
via multiple subdomain exchanges. Symbols, legends, and 
nomenclature are identical to that used in Figures 10 and 11a 
and b. The five subdomains in the " hyp ervari able" region of 
each enzyme are numbered. COS-7 cells transfected with 
vectors encoding each enzyme, or a negative control vector 
(pcDNAI) were stained with the monoclonal antibodies indicated 
in the legend in each panel, and were analyzed by flow 
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cytometry. Histograms display the fraction of antigen- 
positive transfected COS -7 cells (normalized for transfection 
efficiencies, as described in the Examples) , and represent the 
average of either two or three experiments. Mean fluorescence 
intensities (H.F.I.) of antigen-positive cells are indicated 
by the numbers at the left of each histogram , above the arrow 
(t) at the bottom of each panel.. Numbers in parenthesis in 
this column denote the mean fluorescence intensity of the 
antigen-negative cells whenever the experiment yielded no 
antigen-positive cells. Figure 12a, top set of histograms, 
multiple subdomains from Fuc-TVI (shaded areas) installed into 
Fuc-TIII (open rectangles) (chimeras Fuc-TC15, Fuc-TC16, and 
Fuc-TC19). Figure 12a, bottom histogram, subdomains 4 and 5 
from Fuc-TV (lightly shaded areas) replace subdomains 4 and 5 
in Fuc-TIII (chimera Fuc-TC21) . Figure 12b r single subdomains 
from Fuc-TIII (open rectangles) installed into Fuc-TIII 
(shaded rectangles) (chimeras Fuc-TC17, Fuc-TC18, and 
Fuc-TC20) . 



Figure 13 shows protein sequence alignments of Fuc-TIII, 
Fuc-TV and Fuc-TVI across subdomains 4 and 5. The protein 
sequences of subdomains 4 and 5 of Fuc-TIII, Fuc-TV and 
Fuc-TVI are shown in the single letter code. Numbers at the 
left indicate the residue number of the leftmost amino acid of 
each peptide sequence. Dashed lines represent positions in 
Fuc-TV and FucTVI that are identical to corresponding residues 
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in Fuc-TIII. Non- identical residues are explicitly indicated 
by the single letter amino acid designation. Residues present 
in Fuc-TVI , but not found in either Fuc-TV or in Fuc-TIII 
("Unique" to Fuc-TVI) are displayed below, amongst residues 
(indicated by *) that are common to at least two of the three 
enzymes. 

DETAILED DESCRI PTION OF THE PREFERRED EMBODIMENTS 

Generally, the present invention provides a method for 
isolating a gene and/ or a cDNA from a cell, by using a post- 
translational characteristic of the cell. The cell from which 
this gene and/or cDNA may be isolated may be either a cell 
from a unicellular or a multicellular organism. 

In the context of the present invention, a post- 
translational characteristic of a certain cell is defined by 
the ability of that cell to modify a protein or a lipid by an 
enzymatic process that covalently attaches to this protein or 
lipid one or more monosaccharides, or an enzymatic process 
that specifically removes such substituents from a protein or 
lipid molecule. 

In one embodiment, the method comprises the following 

four basic steps: 

(i) identifying for use as a genetic donor, a eukaryotic 
(e.g. mammalian) cell possessing a post-translational 
characteristic of interest i.e. a particular membrane-bound 
oligosaccharide or polysaccharide (i.e. a glycoprotein or 
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glyco lipid) , soluble oligosaccharide or polysaccharide, or a 
particular enzymatic activity f vide infra ) ; 

(ii) creating a genetic library of either cDNA or 
genomic DNA from the genetic material of the donor eukaryotic 
(e.g. mammalian) cell ; 

(iii) identifying a specific eukaryotic host suitable as 
a recipient for gene transfer, and transforming this 
eukaryotic, (e.g. mammalian), host cells with this genetic 
library; and 

(iv) screening the transformed host cells for host cells 
possessing the post-translational characteristic of interest. 

The host cell which now possesses this post-translational 
characteristic contains genetic information related to the 
post-translational characteristic of interest. Using the 
techniques set forth below this genetic information (gene) can 
then be retrieved from the transformed host cell and used by 
standard approaches i.e., Axel et al (U.S. Patent No. 
4,634,665) or Gilbert et al (U.S. Patent No. 4,411,994) to 
produce large quantities of the gene product, i.e., the 
glycosyltransf erase, responsible for the post-translational 
characteristic • 

In step (i) above the donor eukaryotic (e.g. mammalian) 
cell is chosen on the basis of detecting a specific enzymatic 
activity in an extract of the cell or detecting a membrane- 
bound or soluble oligosaccharide or polysaccharide of the 
cell. 
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Thus in one embodiment the enzymatic activity which is 
detected in the cell extract can be enzymatic activity 
attributable to an animal enzyme which post-translationally 
modifies proteins, lipids, or oligosaccharides by 
glycosylation or glycosyl modification. This enzymatic 
activity may be detected by using a substrate specific for one 
of these enzymes. Such substrates are known. 

In another embodiment, in step (i) above a cell is chosen 
on the basis of detection of a specific cellular membrane- 
bound oligosaccharide and/ or polysaccharide. 

In another embodiment , the cell in step (i) is chosen on 
the basis of detecting the presence of a soluble 
oligosacharride or polysaccharide in an extract of the cell, 
or released by the cell in soluble form. 

The present invention provides a novel gene transfer 
approach designated to isolate genes from an organism without 
requiring that amino acid sequence information be obtained 
about the gene product or that an antibody specific to the 
gene product be available. For example, if a gene encoding a 
particular enzyme is sought, a series of cultured cell lines 
or tissues are screened by known and standard methods to 
identify one or more cell lines or tissues containing an 
expressible gene of interest by detecting in an extract of the 
cell or tissue specific enzymatic activity (corresponding to 
the enzyme of interest and thus the cell contains and/ or 
expresses a gene for the enzyme sought) . If an 
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oligo saccharide or polysaccharide membrane component of the 
cell is of interest, a cell line or tissue possessing such a 
membrane characteristic is isolated. If a soluble 
oligosaccharide or polysaccharide is of interest, a cell line 
or tissue possessing a soluble oligosaccharide or 
polysaccharide detectable in an extract thereof is isolated. 

once such a cell line or tissue has been identified, a 
genetic library based on this isolated cell is created. This 
genetic library may be either cDNA or genomic DNA. In a 
preferred embodiment, if the isolated cell is known to be 
susceptible to enhancement of its post-translational 
characteristic of interest by being contacted with a 
particular reagent, this reagent is used to obtain an 
enhancement of the mRNA signal in this cell, and/ or the gene 
itself, which consequently produces amplified in mRNA copies 
and thus ultimately cDNA copies, corresponding to that 
particular gene, or amplified gene segments. Both the cDNA 
and the genomic DNA genetic libraries may otherwise be 
obtained using known techniques. Once the genetic library is 
obtained, it is used to transform host cells using known 
techniques (e.g., by calcium phosphate precipitation, 
liposomal transfection, DEAE dextran transf ection, 
microin j ection , etc . ) . 

Host cells useful in the present invention are preferably 
susceptible to lectin or antibody detection of the desired 
post-translational characteristic; that is, susceptible to 
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lectin or antibody detection of membrane-bound 
oligosaccharide, polysaccharide, or of glycoprotein or 
glycolipid produced in the transformed host cell. However, 
screening of host cells not susceptible to such lectin or 
antibody detection may be achieved through screening for 
enzyme activity in accordance with the invention. 

(A) Host (e.g. mammalian) cells should be eukaryotic 
cells to allow and preserve the catalytic function of the 
enzyme (the glycosyltransf erase) . (B) The host cell should 
not express significant levels of glycosyltransf erase activity 
analogous to the desired one, or the cognate product. With 
glycosyltransf erase-related genes, successful transformation 
of a host cell can be determined by detecting corresponding 
enzymatic activity in an extract of the cell. (C) In another 
characteristic, the host cell should be capable of 
synthesizing the appropriate sugar nucleotide substrate and 
transport it into Golgi (where glycosyltransf erase catalytic 
domains exist and function) . Virtually all wild type animal 
cells possess this function. (D) The host cell should possess 
the ability to synthesize the appropriate acceptor substrate 
(oligosaccharide, lipid, or protein) that the desired 
glycosyltransf erase requires, and the cell must display the 
structure on the cell surface or release it into the cellular 
environment /media. (£) The host cell should allow or provide 
for expression of transf ected sequences that encode or 
otherwise determine expression of the desired 
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glycosyltransf erase. This is inherent in eukaryotic (e.g. 
mammal) to eukaryotic (e.g. mammal) genomic DNA transfer, or 
in vector systems chosen for cDNA expression system, using 
known technology. (F) The host cell should allow for rescue 
of the trans fee ted sequences that encode or otherwise 
determine expression of the relevant glycosyl transferase. 

Wild-type eukaryotic (e.g. mammalian) cells possess these 
characteristics generally. Any particular wild-type cell of 
interest which does not possess criteria (B) or (D) set forth 
above, may be mutated, using standard techniques, to obtain a 
mutant cell possessing either of these criteria. If an enzyme 
assay-based selection method is used, then the criteria (C) 
and (D) set forth above are not necessary. 

Once the host cells have been transformed, the population 
is screened for host cells containing the genetic material of 
interest. This is achieved by determining whether the host 
cell possesses the post-translational characteristic of 
interest, i.e., by detecting enzymatic activity in an extract 
of the transformed host cell, detecting membrane-bound 
oligosaccharide or polysaccharide on the cell, or detecting 
soluble oligosaccharide or polysaccharide in an extract of or 
secreted by the cell. The host cells which test positive are 
isolated, and the gene of interest can be retrieved from these 



If the host cells are transformed by genomic DNA 
transfection, the gene rescue may be carried out as follows: 
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(a) molecular cloning by hybridization, via tagging of 
trans feet ed genomic sequences by species-specific repetitive 
sequences ; or 



(b) tagging of transf ected genomic sequences by in vitro 
or in vivo ligation to marker sequences. 

If cDNA is used to transform the host cells, the 



gene/cDNA rescue may be carried out as follows: 

(a) episomal rescue via Hirt procedure, or 

(b) integrated copy rescue via plasmid tag. 

Further detail with regard to gene rescue is provided in 
the accompanying examples. 

Example of appropriate donor and host cells include the 
following: 



(I) Human Blood Group H a (1,2) fucosyl-transf erase - 
f Ernst et al . J. Biol. Chem. 264:3436-3447, 1989; Raian et al . 



Natl. Acad. Sci. USA . 87:6674-6678, 1990). 

A. ) L cell host-mouse species. 

B. ) Does not express a (1,2) fucosyltransferase 
activity. Does not express Fuca(l,2)Gal structures on 
cell surface. 

C. ) Does synthesize GDP-fucose, the sugar 
nucleotide substrate of a (1,2) fucosyltransferase. 

D. ) Does synthesize Gal/S (1, 4)GlcNAc-R molecules 
that are acceptor substrates for the enzyme, and 
expresses them at cell surface. 



J. Biol. Chem. 264:11158-11167, 1989; Larsen et al . Proc. 
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£.) Mouse cells are known to be able to express 
human genes. 

F.) Mouse cells do not contain DNA sequences 
similar to human Alu repetitive DNA sequences. These Alu 
sequences (species-specific repetitive sequences) were 
used to identify and ultimately rescue the human gene 
from the mouse trans feet ant cell line. 

(II) Mouse a(l, 3 ) galactosyltransf erase - Larsen et al . 
Proc. Natl. Acad. Sci. USA , 86: 8227-8231, 1989. 

A. ) Kidney cell line expressing SV40 virus large T 
antigen - COS-1 cell line, monkey species. 

B. ) Does not express a (1,3) galactosyltransf erase 
activity. Does not express Gala (1,3) Gal structures on 
cell surface. 

C) Does synthesize UDP-galactose, substrate of 
a (1,3) galactosyltransf erase. 

D.) Does synthesize Gal/3 ( 1 , 4 ) GlcNAc-R molecules 
that are acceptor substrates for the enzyme, and 
expresses them at cell surface. 

£.) cDNA/COS-1 cell expression system for cDNA 
libraries - standard technology. 

F.) cDNA/COS-1 cell expression system for cDNA 
libraries - standard technology. 

(III) Human Lewis Blood Group a (1,3/1, 4) fucosyl- 
transf erase - J.F. Kukowska-Latallo et al . Genes and 
Development , vol. 4, (1990), pp. 1288-1303. 
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A. ) Kidney cell line expressing SV40 large T 
antigen COS-1 cell line, monkey. 

B. ) Does not express significant levels of 
a(l,3)fucosyltransferase activity. Does not express 
cell surface Gal0(l,4) [Fuca (1, 3) ]GlcNAc-R structure. 

C. ) Does synthesize GDF-fucose, substrate of 
a ( 1 , 3 ) f ucosy ltransf erase . 

D. ) Does synthesize GaljS (1,4) GlcNAc-R molecules 
that are acceptor substrates for the enzyme, and 
expresses them at cell surface. 

E. ) CDM 7/C0S-1 cell expression system for cDNA 
libraries - standard technology. 

F. ) CDM 7/COS-l cell expression system for cDNA 
libraries - standard technology. 

In the latter stages of selection for this gene, criteria 
C and D were not necessary because pools of cDNA clones were 
screened by transfecting them into C0S-1 cells, and then 
directly assaying extracts prepared from the transfected cells 
for a (1, 3) f ucosy ltransf erase activity. 

In one of its embodiments, the present invention provides 
a method for isolating a gene encoding a glycosy ltransf erase, 
and the gene thus isolated. This glycosy ltransf erase may be a 
fucosyltransf erase, a sialy ltransf erase, a N- 
acetylglucosaminy ltransf erase, a galactosy ltransf erase, a N- 
acetylgalactosaminy ltransf erase, a mannosy ltransf erase, a 
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sulf otransf erase , a glucosyltransf erase, an acetylase, or 
another gly cosy ltransf erase . 

Individual glycosy ltransf erases are known to be 
particularly related to different types of sugars transferred 
by the enzyme ( Bever et al . "Glycosy ltransf erases and Their 
Use in Assessing Oligosaccharide Structure and Structure- 
Function Relationships" Adv. Enzvmolocrv (1982) 52: 23-175 - 
hereby incorporated by reference) • Thus a particular kind of 
sugar linkage found on an oligosaccharide, glycoprotein, or 
glycolipid in or on a cell is associated with a particular 
glycosy ltransf erase. Methods are known for identifying such 
linkages (see Bever et al . supra ) , and can be used in 
accordance with the present invention to isolate the gene 
encoding the corresponding glycosy ltransf erase. 

Sialy ltransf erases, one of the glycosy ltransf erase 
provided by the present invention, are associated with the 
following sialic acid linkages: (1) Siaa2-6Gal; (2) 
Siaa2-3Gal; (3) Siaa2-6GalNac; (4) Siaa2-6GlcNAc; (5) 
Siaa2-8Sia; (6) Siaa2-4Gal; and (7) Siaa2-4GlcNAc. 

Fucosy ltransf erases, another type of glycosy ltransf erases 
provided by the present invention, are associated with the 
following linkages: (1) Fuca ( 1-2 )Gal/3-; (2) 
GaljS ( 1-3 ) [ Fuco ( 1-4 ) ] GlcNAcjS- ; ( 3 ) 

Gal0(l-4) [Fuca(l-3) ]GlcNAc0-; (4) Gal/3 (1-4) [Fuca(l-3) ]Glc; (5) 
-GlcNAcjS ( 1-4 ) [ Fuca ( 1-6 ) ] GlcNAc01-Asn ; ( 6 ) 

-GlcNAc/5 ( 1-4 ) [ Fuca ( 1-3 ) GlcNAc01-Asn ; ( 7 ) Fuca ( 1-6 ) Sal/S- ; ( 8 ) 



WO 97/09421 




7US96/13816 



24- 



Fuca(l-3)Gal0-; (a) Glc01^3Fucal-*o-Thr and Fucal-*)-Thr/Ser; 
(10) Fucal-»Cer amide; and (11) Fucocl-»3Fuc. 

N-Acetylglucosaminyltransf erases, also provided by the 
invention, are associated with the following linkages: (1) 
GlcNAc/3l-*4GlcNAc; (2) GlcNAc01-Asn; (3) GlcNAc£l->2Man; (4) 
GlcNAc01->4Man; (5) GlcNAc01-*6Man; (6) GlcNAcjSl-3Man; (7) 
GlcNAcal->3Man; (8) GlcNAc/3l-3Gal ; (9) GlcNAc01^4Gal; (10) 
GlcNAc/5l-6Gal; (11) GlcNAcal-4Gal; (12) GlcNAcal-4GlcNAc; (13) 
GlcNAc01->6GalNAc; (14) GlcNAc01->3GalNAc; (15) GlcNAcj9l-»4GlcUA; 

(16) GlcNAcal-*4GlcUA; (17) GlcNAcal^4IdUA. 
Galactosyltransf erases, also provided by the invention, 

are associated with the following linkages: (1) Gal/?l-»4Glc; 
(2) Gal01-4GlcNAc; (3) Gal/3l-3GlcNAc; (4) Gal01-6GlcNAc ; (5) 
Gal/Sl-OGalNAc; (6) Gal01-6GalNAc; (7) Gala 1-3 GalNAc; (8) 
Galal->3Gal; (9) Galal-»4Gal; (10) Gal01-*3Gal; (11) Gal/Jl-4Gal; 
(12) Galj9l-6Gal; (13) Gal/Jl-*4xylose; (14) Galj8l-1'- 
sphingosine; (15) Gal01~>l'-ceramide; (16) Gal01->3 diglyceride; 

(17) Ga 10 l-*0-hydroxy lysine; and (18) Gal-S-cysteine. 
N-Acetylgalactosaminyltransferases also provided by the 

invention are associated with the following linkages: (1) 
(GalNAcal-3) [ (Fucal-2) ]Galj8-; (2) GalNAcotl-Ser/Thr; (3) 
GalNAc/Sl->4Gal; (4) GalNAcjS 1-^3 Gal; (5) GalNAca l-O GalNAc ; (6) 
(GalNAc01->4GlcUA01-3) n ; (7) (GalNAc|8l-41dUAal-3-) n ; (8) 
-Man|S-»GalNAc-»GlcNAc->Asn . 

Other glycosyltransf erases , also provided by the 
invention, are associated with the following linkages: 
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TO GalNAc 



Gal/3l-3GalNAc 
Gal/3l-4GalNAc 
Galal-3GalNAc 
GlcNAc/8 1 -3 GalNAc 
GlcNAc01-6GalNAc 



GalNAcal-3GalNAc 
Siaa2 -3 GalNAc 
Siaa2-6GalNAc 



GlcNAc01-3Gal 
GlcNAc/3l-4Gal 
GlcNAc/3l-6Gal 
GlcNAcal-4Gal 
GalNAcal-3Gal 
GalNAc01-3Gal 
GalNAc/3l-4Gal 
Siaa2-3Gal 
Siaa2-6Gal 
To Glc 

Manal-6Glc 



Manal-4Glc 
To GlcNAc 



To Gal 



Gal/?l-3Gal 



Galal-3Gal 



Fucal-2Gal 
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Gal01-4GlcNAc 
Gal01-3GlcNAc 
Fucctl-3GlcNAc 
Fucal-4GlcNAc 
Glcocl-4GlcNAc 
GlcNAcal-4GlcNAc 

Siaa2-4GlcNAc 

To Sia 

Siaa2-8Sia 

To Protein 

GalNAcal-O-Ser/Thr 

Still other glycosyltransf erases provided by the 
invention include : 01 , 3GlcNAc-jSl , 3glucuronyltransf erase , 
glucuronic acid-01 , 4 -N-acety lglucosaminyltransf erase , 
asparagine N-acety lglucasaminyltransf erase , serine 0- 
xylosyltransf erase, xylose 01,4-galactosyltransf erase, 
galactose /3l, 3-galactosyltransf erase, galactose 01,3- 
glucuronyltransf erase, glucuronic acid 01, 4 -N-acety 1- 
galactosaminyltransf erase , N-acety Igalactosamine 01,3- 
glucuronyltransf erase, N-acetylgalactosamine-4- 
sulf otransf erase , N-acety lgalactosamine-6-sulf otransf erase , 
asparagine-0N-acetylglucosaminyltransf erase, serine/threonine- 
otN-acetylgalactosaminyltransf erase, N-acety lglucosamine-0 1,4- 
galactosaminyltransf erase, galactose-01, 3-N- 
acetylglucosaminyltransf erase , N-acetylglucosamine-6- 
sulf otransf erase , galactose-6-sulf otransf erase , glucuronic 
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acid-al , 4 -N-acety lglucosaminy ltransf erase , N-acety lglucosamine 
01 , 4-glucuronyltransf erase , heparin-N-acetyl-glucosamine-N- 
acetyltransf erase , galactose- l , 6-N- 

acetylgalactosyl transferase, heparin-N-acety lglucosamine 
sulf otransf erase , N-acety lglucosamine-ocl , 4-glucoronyl 
epimerase , N-acety lglucosamine-6-sulf otransf erase , N- 
acetylglucosamine-N-sulf otransf erase , glucuronyl-al , 4-N- 
acety lglucosaminy ltransf erase, I durony 1-2 -sulf otransf erase, 
glucuronyl-01 , 4-N-acetylgalactosaminy ltransf erase , and N- 
acetylgalactosamine-al, 3-glucuronyl epimerase. 

These enzymes are associated with the following linkages 
and oligosaccharide structures in connective tissue 
polysaccharides f Rod en, L . "Structure and Metabolism of 
Connective Tissue Proteoglycans," in The Biochemistry of 
Glyc2cograteoproteins and Proteoglycans, wm. Lennarz, ed. pp. 
267-371 Plenum Press, New York, incorporated herewith by 
reference, in particular the tables on pages 269, 270 and 271 
thereof) . 

The above-noted glycosy ltransf erase genes and/ or cDNAs 
are obtained as cloned molecules, or are transferred to host 
cell lines, in accordance with the invention by using the 
post-trans lational property manifest in the cognate and 
appropriate above-noted characteristic linkages, to isolate a 
cell from which the gene or cDNA library is then created, and 
from which the gene or cDNA encoding the glycosyltransf erase 
is isolated. 
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Additional enzymes comprising members of the 
mannosy Itransf erase family include a (1,2) 
mannosy 1 trans f erases , a (1,3) mannosyltransf erases , a ( 1 , 6 ) 
mannosy ltransf erases , and 0(1,4) mannosyltransf erases, 
associated with the construction of the linkages formed in 
asparagine- linked oligosaccharides, as exemplified below (and 
as reviewed in Korn*el4i Ft, ttnfl KQTOfeldi St (1985)) 
"Assembly of asparagine-linked oligosaccharides 91 Annu. Rev. 
Biochem. 54, pp. 631-664.) 



Mm 



Man 



Man 





Man 



Man Man 





Man 



Man 




QlcW 



Gtc&Ac 



Aan 
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Others include ceramide glucosyltransferase and ceramide 
galactosyl transferase, oligosaccharyltransf erase, and O- 
acetylases that o-acetylate N-acetylneuraminic acid (sialic 
acid) o 



Abbreviation s 

Sia; sialic acid XdUAs L-iduronic acid 

Gal; D-galactose GleUA; D-glucuronic acid 

GalNac; D-N-acetylgalactosamine Xyl; D-xylose 

Glc; D«glucose s©r,° serine 

GlcWAc; D-N-acetylglucosamine Thr; Threonine 

Fuc; L-fucose Asn; asparagine — . 
ESan ? D-mannose 



In another embodiment, the present invention provides a 
method for obtaining a soluble or a solid-phase 
oligosaccharide, polysaccharide, lipid, or protein« This 
method comprises contacting an oligosaccharide or 
polysaccharide precursor with a fused protein. The enzyme 
used, provided by the present invention, is either an 
unglycosylated glycosyl transferase or a fused protein which 
comprises two moieties: as the first moiety, at least the 
catalytically functional domain of a glycosyltransf erase ( vide 
infra) ; and, as a second moiety, either a proteinaceous spacer 
attached to the solid support or a proteinaceous component 
comprising an affinity ligando The enzyme of the invention 




WO 97/09421 ^PCT/US96/13816 

-30- 

transforms the precursor into the desired oligosaccharide, 
polysaccharide, glycol ipid, or glycoprotein which is thereby 
obtained. 

A notable advantage of the invention is that it may 
provide, in one embodiment, non-glycosylated 

glycosyl trans f erases. It is thought to be generally true that 
many (if not all) naturally occurring glycosyl transferases are 
glycoproteins. When they are used to produce oligosaccharides 
or polysaccharides from oligosaccharide/polysaccharide 
precursors, the enzymes themselves may be susceptible to 
glycosylation. This (undesired) activity may consume starting 
material and may result in premature loss of enzyme activity. 
The non-glycosylated glycosyltransf erases of the present 
invention do not suffer these salient disadvantages. They may 
be obtained as non-glycosylated enzyme either because they are 
obtained as a product produced in a microorganism deficient in 
the relevant glycosylation mechanism, or in an animal cell in 
which glycosylation of the glycosyltransferase has been 
suppressed. Suppression of the glycosylation of the 
glycosyltransferase in an animal cell is obtained by mutating 
the isolated glycosyltransferase gene using known techniques 
to eliminate the glycosyation sites on the 
glycosyltransferase • 

The non-glycosylated glycosyltransferase of the present 
invention can be a non-glycosylated protein corresponding at 
least to the catalytically functional domain of the 
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glycosyltransf erase ( vide infra ) and up to a non-glycosylated 
protein corresponding to the whole gene encoding the 
glycosyltransf erase . 

In another embodiment, the present invention provides a 
fused protein comprising two moieties as set forth above: as 
a first moiety, at least the catalytically functional domain 
of a glycosyltransf erase; and, as a second moiety, either a 
proteinaceous spacer capable of being attached to a solid 
support or a proteinaceous component comprising an affinity 
ligand. 

Glycosyltransf erases are known to possess three domains 
which correspond to three different areas of the gene encoding 
the enzyme. The area of the gene found at the 3 '-end of the 
gene is known to encode the catalytically functional domain 
( Lowe . Seminars in Cell Biology . (1991) 2:289-307 hereby 
incorporated by reference) . The fused protein of the present 
invention contains at least this catalytically functional 
domain, but it can contain up to the whole protein sequence. 
The protein is produced fused to the second moiety using known 
techniques . 

The second moiety can be used either to anchor the 
catalytically functional domain onto a solid support, or 
permit its recovery by exploiting the presence of a specific 
affinity ligand on the second moiety. For the second moiety, 
the IgG binding domain of Staph . protein A can be used. Such 
a fusion protein can be bound to an IgG-containing solid phase 
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matrix, such as IgG-Sepharose. A number of other alternative 
proteins can be fused to the cat a lytic ally active segments of 
the glycosyl transferase to effect binding to a solid matrix. 
Such proteins, and their respective matrix-associated 
receptors include streptavidin-biotin, IgG heavy chain-protein 
A, and virtually any other known protein or peptide segment 
for which an antibody exists or can be made. 

In another embodiment, the present invention provides a 
method for producing a recombinant glycoprotein, glycolipid or 
free oligosaccharide using, e.g., either an enzyme obtained in 
accordance with the present invention or a recombinant 
organism obtained in accordance with the present invention. 
For example, specific post-translational glycosylation 
capability can be added to a host cell by the following steps: 
first the desired gene or cDNA which has been isolated is 
introduced into a cell by using standard transformation or 
transfection techniques in a manner to obtain an organism 
capable of expression of the transfected cloned gene product; 
the host cell acquires the post-translational capability 
determined by the transfected gene, where the cell did not 
express this capability prior to transfection. Alternatively, 
the approach set forth above is performed, but instead of 
using a single cloned gene that determines post-translational 
capabilities, use is made of uncloned gene segments (high 
molecular weight genomic DNA, for example) or a library of 
cloned genomic DNA fragments or cDNA molecules* Transfected 
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cells generated in this manner are then subjected to selection 
methods based upon detection of a newly-acquired desired post- 
trans la tional capability, to isolate clonal cell lines 
expressing this capability* 

In another embodiment, enzymes obtained in accordance 
with the present invention can be used in an in vitro reaction 
to modify cell-surface oligosaccharide molecules. For 
example, the inventor has purified the blood group A UDP- 
GalNAc transferase and its substrate UDP-GalNAc to convert in 
vitro blood group H oligosaccharide determinants on mouse 
cells to blood group A determinants ( Ernst et ai . J, Biol. 
Chem. (1989) 264:3436-3447). An analogous scheme can be 
employed using enzymes obtained in accordance with the present 



invention, alone, or in conjunction with other available 
glycosyltransf erases and glycohydrolases, to modify existing 



functional eukaryotic or prokaryotic cells, in such a manner 
as to render their cell surface oligosaccharide structures 
relevant for a desired characteristic. 

Such a host cell possessing added specific post- 
translational glycosylation capability is used in accordance 
with known recombinant technology to obtain a glycoprotein, 
glycolipid, or free oligosaccharide. This cell being 
characterized by possessing both the post-translational 
glycosylation capability as well as the capability of 



cell surface oligosaccharide molecules on dead or viable. 
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producing the recombinant glycoprotein, glyco lipid, or free 
oligosaccharide . 

These latter embodiments can be used, for example, to add 
novel oligosaccharide molecules to the surface of specific 
kinds of mammalian cells (cells with specific or general 
immune function, for example) for the purpose of targeting 
them to particular tissues or other locations in the body, for 
a therapeutic or diagnostic use. In particular, such modified 
cells could then be targeted to tissues expressing lectin-like 
cell adhesion molecules that specifically recognize particular 
oligosaccharide structures that have been added to the surface 
of the modified cells. 

In another embodiment, the present invention provides a 
method for suppression of glycosylation activity in a cell. 
In this embodiment, specific pos t- trans la tional glycosylation 
capability is deleted from a host cell. This result can be 
achieved by introducing a specific cloned reagent into a cell 
by standard transformation or transfection techniques, after 
in vitro modifications that (i) inactivate the gene and (ii) 
insert with it or adjacent to it one or more genetically 
selectable markers. Introduction of this modified inactive 
gene effectively replaces the endogenous functional gene via 
homologous recombination, using standard techniques. 

If necessary, two or more rounds of this process can be 
performed to inactivate both wild type genes in a diploid (or 
higher ploidy) organism. The end result is a cell line with 
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(two) non-functional genes not now capable of determining the 
post-translational capability whose elimination was desired. 
Alternatively, the gene obtained in accordance with the 
present invention is introduced to a cell by transformation or 
transfection, in a state in which it is expressed in an anti- 
sense orientation, using standard techniques* This eliminates 
expression of its cognate wild type gene, via standard anti- 
sense expression methods. Treatment of the cell with anti- 
sense synthetic oligonucleotides, whose sequence (s) is (are) 
derived from the gene obtained in accordance with the present 
invention, can also be used to eliminate expression of the 
cognate wild type gene, again via standard methods. 

* 

Alternatively, the gene obtained in accordance with the 

i 

present invention is introduced into a cell by transformation 
or transfection, such that the expression of a new post- 
translational modification prevents or eliminates expression 
of an undesired one. This approach turns on the observation 
that the actions of some glycosyltransf erases on common 
acceptor substrates are mutually exclusive, i.e., 
a(l,2) fucosylation can prevent a(2, 3) siaylation and vice 
versa, or a (1, 3) galactosylation can prevent a (2 ,3) siaylation, 
and vice versa. 

Addition or deletion of cellular post-translational 
capabilities (including glycosylation) allows, for example, 
the generation of host cell lines that can be used to produce 
lipids, proteins, or free oligosaccharides of diagnostic or 
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therapeutic utility, whose specific post-translational 
modifications, including gly cosy lat ion, affect their function. 
For example, recombinant proteins such as tissue plasminogen 
activator or erythropoietin normally exists as glycoproteins. 
Should specific oligosaccharide structures on these 
glycoproteins be shown to have beneficial effects on their 
biosynthesis, serum half life, receptor interaction, or other 
function, the reagents and processes provided by the present 
invention can be used to construct hosts that yield 
recombinant proteins with the specific, and functionally 
optimal, oligosaccharide structures. 

This embodiment can be used, for example, to delete 
specific oligosaccharide molecules from the surface of 
specific kinds of mammalian cells (cells with specific or 
general immune function, for example) for the purpose of 
preventing targeting to their normal, physiologic tissues or 
other locations in the body, and thus allow them to be 
targeted to other non-physiologic targets for therapeutic or 
diagnostic use. In particular, such modified cells can be 
shunted away from tissues where they normally act, to tissues 
expressing lectin-like cell adhesion molecules with 
specificities for other kinds of cells. 

In another embodiment, the present invention provides 
gene products in heretofore unavailable amounts. These gene 
products, glycosyltransf erase enzymes, can be used in 
enzymatic reactors to produce glycoproteins, glycolipids, 
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oligosaccharides or polysaccharides of interest. In this 
embodiment, cloned glycosyltransf erase gene segments can be 
used with standard recombinant protein expression systems to 
generate large amounts of the enzyme encoded by the gene. 
These enzymes can be used in bioreactors in in vitro , large 
scale, synthesis of oligosaccharides or glycolipids, or for 
glycosidic modification of proteins and glycoproteins. 

Acceptor oligosaccharides in such a scheme can be derived 
from any of the following: 

(a) commercially available mono-, di- or higher order 
saccharides prepared from natural sources, or by chemical 
synthesis; 

(b) di- or higher order oligosaccharides produced In 
vitro by other recombinant enzymes generated by this process; 



(c) di- or higher order oligosaccharides produced by or 
purified from cell lines whose post-trans la tional capabilities 
have been engineered as described above. 

In this embodiment, two in vitro bioreactor-type 
approaches can be used. In one embodiment, an oligosaccharide 
acceptor and nucleotide sugar substrate are introduced into 
the reactor containing a solid phase matrix to which is bound 
catalytically active glycosyltransf erase. This matrix can be 
generated using the fusion protein noted above which comprises 
a catalytically active moiety, as a soluble segment of the 
glycosyltransf erase, fused to a protein segment that can be 
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used to bind the fusion protein to a solid phase matrix. A 
specific example of such a fusion protein is a catalytically 
active segment of the mouse a (1,3) galactosyl transferase, 
fused to a segment of the IgG binding domain of Staph . protein 

A (jLansen et al, Proc. Natl, Acad. Sci. fUSAl . 86, 8227-8231, 
1989) . 

Acceptor and nucleotide sugar substrates are incubated in 
such a reactor, at an appropriate temperature, pH f and other 
known conditions, for a length of time sufficient to yield a 
desired amount of oligosaccharide or polysaccharide product. 
The product is then harvested by using known techniques. 

In the variation, the nucleotide sugar substrate and 
soluble glycosyl transferase catalytic domain-containing fusion 
protein is introduced into a reactor containing the 
oligosaccharide acceptor molecule covalently attached (i.e., 
immobilized) to a solid phase matrix. Attachment is carried 
out using known techniques in such a manner as to make 
available to the reaction medium the portion of the 
oligosaccharide acceptor molecule that will be enzymatically 
modified. 

The present invention provides a method for generating 
animal cell lines with specific capabilities for post- 
translational modification of proteins produced by them, as 
well as a method to isolate cloned genes, cloned complementary 
cDNAs, and their mRNAs, that determine the expression or 
biosynthesis of the enzymes responsible for post-translational 
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modification of eukaryotic (e.g. animal, such as mammalian) 
proteins, especially (but not limited to) those post- 



animal, such as mammalian) glycoconjugates, without the need 
to first isolate the protein products of these genes. This 
includes cloned genes, cloned complementary cDNAs , and their 
mRNAs, that encode eukaryotic (e.g. animal, such as mammalian) 



glycosylation and sulfation, as well as phosphorylation, 
methylation, fatty acylation, and removal of glycosyl 
modification (glycohydrolases) . 

The uses of the present invention thus include the 
following: 



(i.) Construction of animal cell lines with specific 
post-translational capabilities (for the production of 
diagnostics and therapeutics) . 

This method can be used to construct animal cell lines 
that are suitable host cells for the production of diagnostic 
or therapeutic material whose usefulness or efficacy depends 
upon specific post-translational modifications. For example, 
the biological effectiveness of most therapeutic proteins or 
peptides, recombinant or otherwise, often depends critically 
upon the oligosaccharide structure (s) that are covalently 
attached to them. The structure of these oligosaccharides is 
primarily a function of the number and kind of 



translational processes that construct eukaryotic (e.g. 



enzymes that post-translationally modify proteins by 
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glycosyl transferase enzymes that are found in the cell used to 
produce these therapeutic products. 

Animal cells and yeasts are competent to perform these 
glycosylation reactions; however, not all glycosyltransf erase 



therefore, some oligosaccharide structures (including sialyl 



Lewis X and/or sialyl Lewis a determinants, and/or their non- 
sialylated variants generated by the enzymes encoded by the 
present DNA sequences) are not produced by them. The converse 
is also true, namely, that the producing cell may express some 
glycosyltransf erases that create oligosaccharide structures 
which prevent effective bioactivity. The present invention 
provides for the creation or elimination of specific 
glycosyltransf erases capabilities in producing cells, so that 
therapeutic effectiveness of products made by the cells may be 
optimized. 

It is likely that sialyl Lewis x and/or sialyl Lewis a 
determinants, and/or their non-sialylated variants, may alter 
the bioactivity (for better or for worse) of natural or 
recombinant therapeutic or diagnostic agents (glycoproteins or 
glycolipids) produced by mammalian or other eukaryotic hosts. 
Eukaryotic host cells that the biotechnology industry uses to 
produce these recombinant agents may be altered with the DNA 
sequence information and related information described in this 
invention, to add sialyl Lewis x and/or sialyl Lewis a 
determinants, and/or their non-sialylated variants, to the 



enzymes are produced by every animal cell or yeast, and 
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oligosaccharides on recombinant product by expressing one or 
more of the chimeric enzymes described here in the desired 
host. 

The old methods used for this process include an 
empirical approach to identify a cell line most appropriate 
for the production of the recombinant or natural product* 
This is generally not optimal since cell lines with suitable 
post -translation modification capabilities may not exist 
naturally, or may not be especially suited to high level 
production of an appropriately modified product. 
Alternatively, unwanted post-trans lational modifications 
present on a therapeutic material produced by an empirically 
identified animal cell line can be removed chemically or 
enzymatically, a process that may be costly or inefficient, or 
both. 

The advantages of the present methods over the older 
methods include the ability to construct cell lines with 
specific post-translational modification capabilities; 
properly constructed, these cell lines eliminate any need for 
chemical or enzymatic treatment of a therapeutic or diagnostic 
material to remove unwanted post-translational modifications. 
Moreover, cell lines with inappropriate post-translational 
modification capabilities, but that are otherwise excellent 
cells for production, may be modified to achieve correct post- 
translational modification of the product. 
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This method allows the construction of animal cell lines 
with post-trans lational modification capabilities precisely 
tailored to the specific needs of a particular diagnostic or 
therapeutic product produced by animal cells. 

(ii.) Isolation of reagents suitable for efficient 
enzymatic synthesis and production of oligosaccharides (in 
enzyme reactors, for example) . 

Oligosaccharides can have therapeutic utility as 
immunomodulatory reagents in the field of organ 
transplantation. In particular, soluble and solid-phase 
oligosaccharides may find use as therapeutic agents with which 
to block or ameliorate antibody-mediated organ transplant 
rejection in cases involving incompatibility due to 
differences in the major blood group antigen systems of the 
organ donor and the recipient. Similarly, soluble 
oligosaccharides can find use as therapeutic agents that 
function by blocking attachment of bacterial, viral, or 
parasitic pathogens to glycocon jugate receptors found on the 
surface of the animal tissues that these pathogens invade. 
For example there is evidence that portions of the Lewis blood 
group oligosaccharide antigens serve as "receptors 11 for some 
forms of uropathogenic bacteria. 

Moreover, glycocon jugates have been implicated in 
modulating adhesive events between cells and between cells and 
their environment during developmental and differentiation 
processes. These events included binding of spermatozoa to 
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eggs, and the initial events that mediate attachment of 
fertilized ova to the uterine vail at the beginning of 
implantation. These observations show, for example, the 
possibility that contraceptive uses for (biologically 
"natural") oligosaccharide molecules exist. 

In addition, specific glycoconjugates containing sialyl 
Levis x determinants have been implicated as ligands for the 
LECCAM/ Select in family of adhesion molecules, that play 
important roles in mediating adhesion betveen cells of the 
immune system, and some tumor cells, and the surfaces of the 
endothelial cells that line the vascular tree. These 
observations suggest the possibility that the cloned 
fucosyltransf erase sequence described here could be used to 
construct oligosaccharide-type molecules, with pharmaceutical 
properties possessing anti-inflammatory and anti-tumor 
metastatic functions. 

Currently, oligosaccharides of defined structure are 
produced by chemical synthesis (a procedure that is 
inefficient and costly) or by isolation from natural sources 
(using costly and inefficient procedures that often require 
the processing of large quantities of animal or plant 
material, and the purification of the desired oligosaccharide 
from other contaminating oligosaccharides) or by 
glycosyltransferase-assisted synthetic procedures, using other 
recombinant a ( 1 , 3 / 1 , 4 ) fucosyltransf erases . 
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The present invention provides a mechanism for the 
isolation of cloned glycosyl transferase genetic sequences, 
which in turn can be used to economically synthesize abundant 



quantities of purified glycosyltransf erase enzymes. These can 
be used to construct enzyme bioreactors (enzymes in solution 
or immobilized on a solid phase matrix) capable of enzymatic 



synthesis of these structures* 

This is more efficient than approaches involving the 
chemical synthesis of oligosaccharides or their purification 
from natural sources, for a variety of reasons. One, the only 
chemicals necessary are the enzyme substrates; most of these 
are easily obtained or synthesized. Two, enzymatic synthesis 
produces only the desired product and the nucleotide 



hydrolysis. These latter two chemicals are found as the 
natural byproducts of these reactions in animal cells, are 
essentially non-toxic, and may be easily separated from the 
oligosaccharide synthetic product. 

By contrast, chemical synthetic procedures typically 
generate numerous products of side reactions which must be 
removed, and which may be toxic as well. Similarly, 
purification of oligosaccharides from natural sources requires 
the removal of other contaminating oligosaccharides present in 
the natural material. 

Three, enzymatic catalysis is extraordinarily efficient; 
virtually complete conversion of substrate to product can be 



monophosphate or nucleotide diphosphate product of substrate 
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achieved. By contrast, chemical synthesis of these structures 
is a multi-step process; yields at each step may be much much 
less than 100%, and the cumulative efficiency of current 
chemical synthesis procedures does not approach the efficiency 
possible with enzymatic synthesis. Similarly, purif ication of 
oligosaccharides from natural materials can entail significant 
losses inherent to the purification procedures required to 
separate the desired oligosaccharide from contaminating, 
irrelevant and/ or undesirable oligosaccharides, with 
inefficient isolation of the desired oligosaccharide. 

Although glycosyltransf erases for synthetic use may be 
purified from animal tissues, these purifications are 
themselves inefficient, primarily because the enzymes are 
typically present in very low abundance. The present 
invention provides two mechanisms that provide for the 
abundant production of these enzymes. 

First, this can be done through the construction and 
selection of animal cells that produce relatively large 
quantities of the enzymes. Alternatively, this invention 
provides a mechanism to isolate cloned cDNAs encoding these 
enzymes, or to construct synthetic genes that encode these 
enzymes via information derived from such cloned cDNAs or 
genes. These cloned nucleic acid sequences can then be used 
with standard recombinant DNA technologies to produce large 
quantities of glycosyltransf erases. 
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(iii.) Isolation of reagents suitable for producing 
recombinant glycosyltransf erases to be used directly as 
research reagents, or to be used to generate anti- 



glycosyl transferase antibodies for research applications. 

The present invention provides two mechanisms for 
producing large quantities of these enzymes (see (ii.) above - 
i.e., specially constructed animal cells, or via natural or 
synthetic genes encoding these enzymes) which may be used as 
research tools with which to study the structures and 
functions of oligosaccharides and glycoproteins. Likewise, 
the enzymes produced by this method, or the nucleic acid 
sequence and derived protein sequence provided by this method, 
may be used to generate antibodies to these enzymes (via 



immunization with synthetic peptides whose sequences are 
derived from the cloned enzyme cDNAs or genes, or by direct 
immunization with the recombinant enzymes) . These antibodies 
can also be used as research reagents to study the 
biosynthesis and processing of these enzymes, and can be used 
as an aid in their purification for all the uses described in 
this disclosure. 

(iv.) Antibodies to glycosyltransf erases as diagnostic 
reagents . 

Some of these glycosyltransf erases have been implicated 
as tumor markers in body fluids. The enzymes have typically 
been assayed in these fluids by activity assays, which may be 
subject to non-specificity due to competing 
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glycosyltransf erase activity. These assays may also be 
insensitive since it is possible that inactive enzymes might 
be useful as tumor markers but would not be detected by enzyme 



antibodies to these enzymes {monoclonal and polyclonal 
antibodies against synthetic peptides constructed from 
information derived from cloned glycosyltransferase cDNAs or 
genes, against enzymes produced by recombinant 
glycosyltransf erases, or against enzymes produced by animal 
cells constructed by this method) . Anti-glycosyltransf erase 
antibodies specific for particular glycosyltransf erases can be 
produced by this means, and can be used to detect and 
quant itate glycosyltransf erases in body fluids with 
specificity and sensitivity exceeding enzyme activity assays. 

(v.) Engineering of glycosyltransferase substrate 
specificity to generate novel glycocon jugate structures on 
secreted or cell-associated glycocon jugates. 

The present invention provides reagents (cloned 
glycosyltransferase genes or cDNAs) and genetic selection 
methods that, when used with appropriate known mutagenesis 
schemes, allow the generation of mutant glycosyltransf erases 
that generate glycosidic linkages different from that 
generated by the wild-type enzyme. These novel linkages may 
or may not be naturally occurring, and find utility as 
moieties that enhance bioactivity of the molecules to which 



activity assays. 



The present invention provides a mechanism for generating 
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they are attached. Alternatively, mutagenesis and selection 
approaches can be used to generate mutant enzymes that act in 
a dominant negative fashion. The dominant negative mutants so 
generated can be used to inactivate endogenous 
gly cosy Itransf erase activities when the product (s) of such an 
enzyme are not desired. 

This invention allows the isolation of 
gly cosy Itransf erase genes (as well as genes that direct the 
synthesis of enzymes that perform the post -trans lational 
modifications) by methods designed to identify the surface- 
expressed product of the enzyme, and without the need to 
purify the enzyme as is required of standard molecular cloning 
procedures (i.e., without any information about the primary 
structure of the enzyme, and without antibodies directed 
against the enzyme) • 

A consequence of one implementation of this method is the 
generation of cells with specific capabilities for 
glycosylation. One version of the detailed implementation of 
this method is described in the following publications by the 
inventor, J. Biol. Chem. (1989) 264(6) : 3436-3447 and J. Biol. 
Chem. (1989) 264(19) : 11158-11167, both of which are herein 
incorporated by reference. 

In outline, this version of the method entails the 
generation of cultured animal cell lines with specific 
abilities to construct desired glycocon jugate structures, by 
introducing exogenous genetic material into cells that do not 
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express the desired glycosyltransf erase or its product, using 
genetic material from cells that do express the desired 
enzyme. A positive selection procedure is then employed to 
identify transfected cells that express the enzyme product on 
the surface of the cell. The transfected genetic sequences 
responsible for this new phenotype are then isolated by 
standard procedures involving gene library construction and 
nucleic hybridization. This method allows the isolation of 
the genetic material determining expression of the 
glycosyltransf erase without the need to purify the enzyme. 

Although detection and isolation of these sequences by 
hybridization procedures involving a dispersed and repetitive 
human DNA sequence (Alu) is used to illustrate isolation of 
the gene, other methods may be used to "tag" transfected 
sequences,, including but not limited to the ligation to the 
transfected DNA of DNA markers that allow identification and 
isolation of the desired genes by nucleic acid hybridization 
or genetic selection (supF or G418 resistance "Neo" sequences, 
for example) procedures. Three methods for the selection of 
transfected cells with the appropriate phenotype, flow 
cytometry, "rosetting", and "panning 11 , are described in 
Examples I, II, and III. Although an antibody specific for 
the enzyme product was used in the examples, other non- 
antibody reagents that specifically recognize surface 
expressed enzyme products may also be employed, including 
plant and animal lectins. 
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The enzymes provided by the present invention are imbued 
with certain unique characteristics, as compared to the 
corresponding native enzyme. Naturally-derived 
glycosyl transferases have been purified, with certain claims 
being made to the homogeneity of the product obtained. 
Nonetheless, such claims of homogeneity have been made based 
upon analyses of the preparations by SDS-polyacrylamide gel 
electrophoresis methods. In the older literature (i.e., pre- 
1982) the homogeneous enzyme was identified in the gel by 
Coomassie blue staining, or other staining methods, that are 
notably less sensitive than contemporary silver staining 
approaches. It is thus almost certain that such preparations 
were less than homogeneous. 

In more contemporary literature, three 
glycosyltransf erases have been analyzed by silver staining 
methods (i.e., rat sialyl-T, GlcNAc-T-I and GlcNAc-T-II) . 
These appear to be virtually free of contaminant proteins. 
Nonetheless the small amounts of final pure proteins obtained 
using these purification procedures were analyzed using the 
sensitive silver staining method, which is not sufficiently 
sensitive to detect levels of contaminants of roughly 5 to 10 
wt.%, in the small amounts of pure protein available. Thus, 
prior to the present invention, glycosyltransf erases having a 
level of purity of at least 95 wt.%, preferably at least 98 
wt.% were not available. The present recombinant 
glycosyltransferases which are obtained using cloned 
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glycosyltransf erase DNA sequences, in large amounts, in 
soluble form, or fused to an aff inity-purif iable protein 
segment, can be obtained in a truly, heretofore unavailable, 
homogeneous state. 

The proteins provided by the present invention, as noted 
above, may also be distinguished from heretofore available 
proteins by the fact that they can be made in a non- 
glycosylated form. Many, if not all naturally-derived 
glycosyltransf erases are glycoproteins, that is, they contain 
themselves one or more N-linked and/or 0-1 inked 
oligosaccharide structures. These structures can themselves 
be glycosylated by the enzyme itself in an enzymatic reactor, 
for example, and this represents a competing acceptor 
substrate which could reduce the efficiency of the reaction 
and contribute to premature enzymatic activity loss. This 
"autoglycosylation" phenoma has the potential of either 
inactivating or reducing the catalytic efficiency of the 
enzyme and/ or bi ore act or. 

Cloned glycosyltransf erases provide a way to circumvent 
this problem. Firstly, expression of cloned 

glycosyltransferases in a bacteria host, such as £. coli , that 
is incapable of glycosylating these enzymes, will yield large 
amounts of non-glycosylated glycosyltransferases. These 
recombinant proteins can be used in a bioreactor, and since 
they are not themselves glycosylated, may be superior in 
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performance to the naturally derived, glycosylated enzymes for 
the reasons noted above. 

Alternatively, if it is necessary to express these 
enzymes in an eukaryotic cell host that is capable of 
glycosylating the recombinant enzyme, standard site-directed 
mutagenesis approaches can be used to eliminate from the 
recombinant protein the amino acid signals that allow animal 
cells to glycosylate the enzymes. These known signals include 
certain asparagine residues, falling within the N-X-T or N-X-S 
motif that allows asparagine -linked glycosylation, and also 
includes some serine and threonine residues that are 
substrates for 0-1 inked glycosylation. 

Standard mutagenesis methods can be used to alter the DMA 
sequence encoding the glycosyltransf erase to either delete the 
codon that encodes these N, S or T residues, or change the 
respective codon to a codon that determines an amino acid with 
similar physical properties, but that is incapable of 
supporting N-linked or 0-linked glycosylation. 

The present invention also provides unique mutant 
recombinant glycosyltransf erases. Isolation and expression of 
glycosyltransf erase genes and cONAs offers the opportunity to 
generate mutant glycosyltransf erases with properties superior 
to the fixed properties inherent in the naturally occurring 
enzymes. Standard techniques based upon site-directed, or 
random mutagenesis, can be used to obtain mutant 
glycosyltransf erases with some of the illustrative properties: 
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(1) Minimal catalytic domain: progressive deletion of 
amino acids from the glycosyl transferase protein can be 
achieved, and the resulting mutant glycosyltransf erases can be 
tested for activity. Based upon known functions for different 
parts of these molecules, it can be predicted that a 
catalytically active mutant glycosyltransf erase can be 
produced that is (a) soluble (lacks transmembrane segment on 
natural glycosyltransf erases that render them insoluble and 
thus unsuitable for bioreactors) , and (b) much smaller than 
the natural glycosyltransf erase (which retains transmembrane 
segment and the "stem" region, neither of which are necessary 
for catalytic activity) . 

On a protein mass basis, small catalytically-active 
domains derived from mutated glycosyltransf erase genes or 
cDNAs represent more catalytic activity than the larger, 
naturally occurring glycosyltransf erases that carry along non- 
catalytically active transmembrane and/or stem region protein 
"baggage. " Thus, the recombinant mutant-derived catalytic 
domain is much more efficient for use in vitro synthesis of 
oligosaccharides, and by a reactor for example. 



Approaches to amplification of mRNA for glycosyltransf erases : 
The cell line used as a source of genetic material (mRNA 
for cDNA library construction, or genomic DNA for genomic 
library construction or genomic DNA transfection) for gene 
transfer to isolate a glycosyltransferase gene can be 
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manipulated to optimize this process. Selection can be 
applied to this cell line to enhance steady state levels of 
the glycosyltransf erase mRNA, and/or amplify its respective 
gene so that multiple copies of it exist in this donor cell. 
This can be done by subjecting the cell line (either after or 
without chemical , radiation, or other mutagenesis method) to a 
selection procedure that selects a variant of the cell that 
expresses higher amounts of the glycosyltransf erase 
oligosaccharide product, at the surface of the cell, for 
example. This type of approach is illustrated in Example II. 

Increased numbers of the oligosaccharide product 
molecules correlate with increased numbers of the cognate 
glycosyltransf erase (s) enzyme molecules within the cell, and 
with an increase in steady state levels of the 
glycosyltransf erase mRNA. Higher levels of this 
glycosyltransf erase mRNA means that more copies of the 
' respective cDNA will be present in a cDNA library prepared 
from the high-expression variant cell line, and thus will 
increase the likelihood of rescuing these glycosyltransf erase 
cDNAs from the library. In some cases, higher levels of the 
specific mRNA can be associated with an increase in the number 
of copies of the cognate glycosyltransf erase gene. Since such 
an amplified glycosyltransf erase gene is more abundant in the 
cell's genome than other irrelevant genes, and more abundant 
than in a parental, non-selected cell line, it is easier to 
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isolate by genomic DNA library or genomic DNA transfection 
approaches. 

It can be shown by transfection studies that expression 
of some oncogenes can increase expression of some 
gly cosy Itransf erases. Thus a cell line can be modified by 
transfection with one or more oncogenes, using standard 
transfection methods, and readily available vectors containing 
oncogenes, and resultant trans fected clones can be assayed for 
increased glycosyltransf erase levels. Such clones can also be 
identified or selected for by FACS or lectin selection methods 
outlined in Examples I, II, and III below. These clones can 
then be used for cDNA library preparation as noted above. 

A number of chemical reagents have been shown to induce 
expression of, or increase expression of, 

gly cosy Itransf erases, in cell lines. This may be associated 
with in vitro differentiation of the cell line, and such 
agents include retinoic acid, as well as dimethylsulphoxide- 
induced differentiation of human and mouse hematopoietic 
precursors, with concomittant increases in the expression of 
some glycosyltransf erases. This occurs because of an increase 
in the steady state level of mRNA for the glycosyltransferase 
in question, and can be used to enhance the ability to isolate 
a cognate cloned cDNA using the cDNA library-mediated 
transfection approach (as shown in Example II, below) . 

An alternative approach for the isolation of genes or 
cloned cDNAs that encode animal glycosyltransf erases (or other 
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post-translational modification enzymes) , by detecting the 
enzyme product at the cell surface, and without the need to 
purify the enzyme, is as follows: cDNA libraries are 
constructed in a plasmid or phage vector that will express the 
cloned cDNAs in a mammalian or yeast host, using mRNA prepared 
from cells or tissue that expressed the desired enzyme. This 
cDNA library is then screened for the desired cDNA by 
introducing the library into a host cell line that does not 
express significant amounts of the enzyme, nor its surface- 
expressed product, but that does have the necessary enzyme 
substrate molecules, and which is capable of displaying the 
enzyme's oligosaccharide product on its surface. The host 
cells which have taken up the cDNA library are subjected to 
selection for cells that contain the desired cDNA and thus 
express the new corresponding oligosaccharide product, using 
flow cytometry, rosetting, or panning, and a reagent specific 
for the enzyme's oligosaccharide product. Cloned cDNAs that 
direct the expression of the desired enzyme may then be 
isolated from the selected cells by standard methods. 

This approach may be used with the following techniques: 

1. Stable transfection into animal cells, selection, 
followed by rescue of the desired clone cDNA by nucleic acid 
hybridization procedures, or by the COS cell fusion technique, 
depending on the vector used. 

2. Transient transfection into COS or WOP cells, 
selection, followed by rescue of the desired clone cDNAs, by 
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the method of Seed . Proc. Nat'l. Acad. Sci. (USA) (1987) 
8£: 3365-3369, or similar methods that make use of cDNA cloning 
vectors that replicate as episomes in mammalian cells (i.e. 
Maraolskee et al. Mol. Cell, Biol.. (1988) 1:2837-2847. 

3. Transformation of yeast cells, selection, followed by 
rescue of the desired cloned cDNA by nucleic acid 
hybridization procedures. 

In addition, the mammalian cDNA expression libraries may 
also be screened by the sib selection method, using an enzyme 
assay to detect pools of cDNA molecules that contain cloned 
cDNAs encoding the relevant glycosyltransf erase (or other 
enzyme involved in post-translational modification) . 
Specifically, a cDNA library is constructed in a mammalian 
expression vector (plasmid or phage) , using mRNA prepared from 
a cell that expresses the appropriate glycosyltransf erase. 

This library is then screened by first dividing it into 
pools of bacteria cells containing clones, each pool 
representing some fraction of the library, but the pools in 
aggregate representing the entire library. A portion of each 
pool is stored, and the remainder (containing sibs of the 
stored clones) is processed for cDNA-vector DNA molecules 
(i.e. plasmid or phage DNAs) . The DNA prepared from each pool 
is separately introduced into one of the appropriate host 
cells described above (see 1, 2, and 3), and, after allowing 
for an appropriate expression time, extracts are prepared from 
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the trans fee ted or transformed host cells and these extracts 
are assayed for the appropriate glycosyltransf erase activity* 

The smallest pool that is thus found to contain plasmids 
directing the synthesis of the appropriate enzyme is retrieved 
from storage and subdivided. Again, a representative portion 
of these pools is stored, and the remainder (again containing 
sibs of the stored clones) of each is processed for plasmid 
DNA, transfection or transformation, expression, extract 
preparation, and enzyme assay. This process is repeated until 
single clones are isolated that direct expression of the 
relevant glycosyltransf erase. Thus, this process does not 
rely upon surface expression of the enzyme product to isolate 
the appropriate cloned cDNA or gene. A version of this 
approach is presented in Example III. 

The procedure used in the present invention need not be 
restricted by the genetic complement of the host; the gene 
transfer aspect of this invention allows expression of genes 
not normally expressed or even present in the recipient cell. 
Although the present text specifically illustrates application 
to glycosyltransf erases, it may be applied to the enzymes and 
genes that control other forms of post-trans lational 
modification, including sulfation, phosphorylation, 
methylation, fatty acylation, and removal of glycosyl 
modification (glycohydrolases) . 

The method described to this point involves isolation of 
glycosyltransf erase genes or cDNAs by selection for a dominant 
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glycosylation trait. The transient expression system 
described for use in COS or WOP cells can also be used to 
identify and clone cDNAs homologous to glycosyltransf erase 
transcripts already present in the COS or WOP host. 

Specifically, cloned cDNAs transcribed in the "anti- 
sense" orientation may eliminate expression of the cognate 
glycosyltransf erase in the COS or WOP host, resulting in a 
recessive glycosylation trait. These DNA sequences can then 
be isolated by selection for surface expression of the 
oligosaccharide linkage recognized by the glycosyltransf erase 
whose expression was eliminated, again by the procedures 
described below (flow cytometry, "rosetting", and "panning"), 
as detailed in Examples I, II, and III. Alternatively, the 
sib selection approach might be used to identify cloned cDNA 
molecules that decrease or eliminate the expression of an 
endogenous glycosyltransf erase, as determined by enzyme 
assays • 

The DNA sequences and corresponding glycosyltransf erases 
of the present invention are summarized in the following . 
Table . 
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SEQ ID NO:l encodes a protein sequence termed Fuc-TIII 
capable of functioning as a GDP-Fuc: [0-D-Gal(l,4/l,3) ]-D- 
GlcNAc(/Glc) a (1, 3/1, 4) -fucosyl transferase. This protein is 
an enzyme that can be used to construct the oligosaccharide 
"ligand" for Endothelial Leukocyte Adhesion Molecule- 1 (ELAN- 

1) that is disclosed in Applicant's co-pending U.S. Patent 
Application Serial No. 07/603,018, filed October 25, 1990, 
which is hereby incorporated by reference. This ligand is the 
sialyl-Levis x molecule. Also, this enzyme, when expressed by 
the cloned DNA sequence described here, functions within 
mammalian cells to generate de novo expression of specific 
cell surface glycocon jugate structures on those cells. These 
structures are recognized by antibodies against the following 
cell surface glycocon jugate structures (See Fig. 8 and Table 

2) . 



.-1 or Lewis x Gal0(l,4) [Fucot(l,3) ]GlcNAc 

sialyl -Lewis x NeuAca(2,3)Gal/?(l, 4) [Fuca (1, 3) ] GlcNAc 

Lewis a Gal0(l, 3) [Fuca(l,4) ]GlcNAc 

sialyl-Levis a NeuAca ( 2 , 3 ) GaljB ( 1 , 3 ) [ Fuca (1,4)] GlcNAc . 

In the above DNA sequence; (I) , the sequence corresponding 
from amino acid position 43 to amino acid position 361 is 
functional, but a larger sequence of up to the whole sequence 
shown can be used. 

This enzyme, when expressed by the cloned DNA sequence 
described here, functions in the enzymatic manner indicated in 



# 
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its name, when assayed in extracts prepared from cells that 
express the DNA sequence (See Table 2) . The oligosaccharide 
product of this enzyme represents fucose linked in alpha 1,3 
configuration to neutral or a (2, 3) sialylated "type II" 
acceptors, or fucose linked in alpha 1,4 configuration to 
neutral or a (2 ,3) sialylated "type I 9 ' acceptors as shown 
below: 



l-1 or Lewis x Gal/3(1,4) [Fuca(l,3) ]GlcNAc 

sialy 1-Lewis x NeuAcos ( 2 , 3 ) Gal0 ( 1 , 4 ) [ Fuca (1,3)] GlcNAc 

Lewis y Fuca(l,2)Galj3(l,4) [Fuccr(l,3) ]GlcNAc 

Lewis a Galj8(l, 3) [Fuca (1,4) ]GlcNAc 

sialy 1-Lewis a NeuAca (2,3) Gal/3 ( 1 , 3 ) [ Fuca (1,4) GlcNAc 

Lewis b Fucct(l,2)Galj3(l,3) [Fuco(l, 4) ]GlcNAc. 

Throughout the remainder of this text, these products will be 
referred to as sub-terminal a (1,3) and a (1,4) fucose residues. 

The catalytic domain of this enzyme has also been 
localized by expression studies. The enzymatic properties of 
the enzyme encoded by this cDNA, and chromosomal localization 
studies, indicate that this cDNA is the product of the human 
Lewis blood group locus. 

This DNA sequence and the corresponding protein have the 
following uses: 

(i.) Construction of animal cell lines with specific 
capabilities with respect to post-translational modification 
of the oligosaccharides on cell-surface, intracellular, or 
secreted proteins or lipids by sub-terminal a (1,3) and a (1,4) 
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fucose residues that represent the products of this enzyme 
(for the production of diagnostics and therapeutics) . 

Specifically, the present cloned DNA sequence can be 
introduced by standard technologies into a mammalian cell line 
that does not normally express the cognate enzyme or its 
product (sub-terminal a (1,3) and a (1,4) fucose residues on 
oligosaccharides) , and transcribed in that cell in the "sense 91 
direction, to yield a cell line capable of expressing sub- 
terminal a (1,3) and a (1,4) fucose residues on oligosaccharides 
on cell-surface, intracellular, or secreted glycoproteins or 
lipids . 

Alternatively, this cloned DNA sequence may be introduced 
by standard technologies into a mammalian cell line that does 



express the cognate enzyme and its product (sub-terminal 
a (1,3) and a (1,4) fucose residues), and transcribed in that 
cell in the "anti-sense" direction, to yield a cell line 
incapable of expressing sub-terminal a(l,3) and a (1,4) fucose 
residues on cell-surface, intracellular, or secreted glyco- 
proteins or lipids. 

Alternatively , the endogenous GDP-Fuc : [ 0-D-Gal (1,4/1,3)]- 
D-GlcNAc(/Glc)a(l,3/l,4)-fucosyltransferase gene(s) , in a 
mammalian cell expressing the cognate enzyme (s), can be 
inactivated with the DNA sequence described here by homologous 
recombination techniques, or by "anti-sense" gene expression 
or oligonucleotide approaches based upon the DNA sequence 
described herein, or by dominant negative mutant 
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fucosyltransf erase sequences that inactivate endogenous GDP 



pu C: [£-D-Gal(l, 4/1,3) ]-D-GlcNAc(/Glc) a (1, 3/1,4) - 
fucosyltransferase(s) and that may be derived via mutagenesis 
and genetic selection schemes, in conjunction with the 
sequence information in this text. 

This method can be used to construct animal cell lines 
that are suitable host cells for the production of diagnostic 
or therapeutic materials whose usefulness or efficacy depends 
upon the specific post-translational modification determined 
by this cloned DNA sequence and its cognate enzyme. For 
example, it is known that the biological effectiveness of many 
therapeutic proteins or peptides, recombinant or otherwise, 
may depend critically upon the oligosaccharide structure (s) 
that are covalently attached to them. The structure of these 
oligosaccharides is primarily a function of the number and 
kind of glycosyltransferase enzymes that are found in the cell 
used to produce these therapeutic products. 

Animal cells and yeasts are competent to perform these 
glycosylation reactions; however, not all glycosyltransferase 
enzymes are produced by every animal cell or yeast, and 
therefore, some oligosaccharide structures (including sub- 
terminal a (1,3) and a (1,4) fucose residues generated by the 
enzyme encoded by the DNA sequence described here) are not 

produced by them. 

The converse is also true, namely, that producing cells 
may express a glycosyltransferase analagous to, or identical 
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to, the GDP-Fuc: [0-D-Gal(l,4/l,3) ]-D-GlcNAc(/Glc)a(l,3/l,4)- 
Fucosyltransf erase encoded by the DNA sequence described here. 
It is likely that sub-terminal a (1,3) and a (1,4) fucose 
residues alter the bioactivity (for better or for worse) of 
natural or recombinant therapeutic or diagnostic agents 
(glycoproteins or glycolipids) produced by mammalian or other 
eukaryotic hosts* Eukaryotic host cells that are used to 
produce these recombinant agents can be altered with the DNA 
sequence information and related information described in this 
invention, to add subterminal a (1,3) and a (1,4) fucose 
residues to the oligosaccharides on recombinant product by 
expressing all or part of the cloned sequences described here 
in the desired host. Alternatively, sub-terminal a (1,3) and 
a (1,4) fucose residues may be eliminated from the product 
produced in these host cells by the use of transfected "anti- 
sense" vector constructs, recombination-based gene 
inactivation, "anti-sense" oligonucleotide approaches, or 
dominant negative mutant fucosyltransf erases, outlined above. 

The old "methods" used for this process include an 
empirical approach to identify a cell line that does or does 
not express this particular enzyme or an enzyme that functions 
in a similar or identical manner, for the production of the 
appropriately modified recombinant or natural product. This 
is not always optimal since cell lines with this particular 
post-translation modification capabilities may not exist 
naturally, or may not be especially suited to high level 
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production of an appropriately modified product. 
Alternatively, unwanted sub-terminal a (1,3) and a (1,4) fucose 
residues present on a therapeutic material produced by an 
empirically identified animal cell line must be removed 
chemically or enzymatically, a process that may be costly or 
inefficient. 

The advantages of using the cloned, functional DNA 
sequence described here in conjuntion with the technologies 
outlined above, relative to these older methods, include the 
ability to construct lines that specifically lack the 
capability to generate sub-terminal a (1,3) and a (1,4) fucose 
residues on the oligosaccharides of glycoproteins and 
glycolipids; properly constructed, these cell lines will 
eliminate any need for chemical or enzymatic treatment of a 
therapeutic or diagnostic material to remove unwanted sub- 
terminal a (1,3) and a (1,4) fucose residues. Moreover, in the 
event that sub-terminal a (1,3) and a (1,4) fucose residues are 
found to be desirable for a particular diagnostic or 
therapeutic product produced by animal cells, cell lines may 
be engineered with the cloned DNA sequence described here to 
generate these residues. 

(ii.) Isolation of reagents suitable for efficient 
enzymatic synthesis and production of oligosaccharides (in 
enzyme reactors, for example) • 

Oligosaccharides have therapeutic utility as 
immunomodulatory reagents in the field of organ 
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transp latitat ion. In particular, soluble and solid-phase 
oligosaccarides find use as therapeutic agents with which to 
block or ameliorate antibody-mediated organ transplant 
rejection in cases involving incompatibility due to 
differences in the major blood group antigen systems of the 
organ donor and the recipient, including the Levis blood group 
system. Likewise, soluble oligosaccharides may find use as 
therapeutic agents that function by blocking attachment of 
bacterial, viral, or parasitic pathogens to glycocon jugate 
"receptors 1 ' found on the surface of the animal tissues that 
these pathogens invade. 

For example there is evidence that portions of the Lewis 
blood group oligosaccharide antigen (containing sub- terminal 
a (1,3) and a (1,4) fucose residues) serve as "receptors" for 
some forms of uropathogenic bacteria. Moreover, 
glycoconjugates, including subterminal a (1,3) and cr(l,4) 
fucose residues, have been implicated in modulating adhesive 
events between cells, like leukocyte-ELAM-1 interactions, and 
between cells and their environment during developmental and 
differentiation processes. These events include binding of 
spermatozoa to eggs, and the initial events that mediate 
attachment of fertilized ova to the uterine wall at the 
beginning of implantation. These observations suggest, for 
example, the possibility that contraceptive uses for 
(biologically "natural") oligosaccharide molecules might 
exist. Oligosaccharide molecules constructed by this enzyme 
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can disrupt leukocyte-ELAM interactions and thus function as 
anti-inflammatory agents. 

Currently, oligosaccharides containing sub terminal o(l r 3) 
and a (1,4) fucose residues are produced by chemical synthesis 
(a procedure that is inefficient and costly or both) or by 
isolation from natural sources (using costly and inefficient 
procedures that often require the processing of large 
quantities of animal or plant material, and the purification 
of the desired oligosaccharide from other contaminating 
oligosaccharides) . 

The invention described here provides a mechanism to 
synthesize abundant quantities of purified GDP-Fuc: [0-D- 
Gal (1,4/1,3)] -D-GlcNAc ( /Glc ) o ( 1 , 3 / 1 , 4 ) -Fucosy 1 trans f erase , 
Fuc-TIII. This can be used to construct an enzyme bioreactor 
(enzyme in solution or immobilized on a solid phase matrix, 
for example via the protein-A moiety fused to the catalytic 
domain of the enzyme (as described in Kukowska-Latallo et al . 
Genes Devel. . (1990) 4:1288-1303) capable of enzymatic 
synthesis of structures containing sub-terminal a (1,3) and 
a (1,4) fucose residues. 

This is more efficient than approaches involving chemical 
synthesis of structures containing sub-terminal a (1,3) and 
a (1,4) fucose residues or their purification from natural 
sources, for a variety of reasons* One, the only chemicals 
necessary are the enzyme substrates and co- factors; these are 
easily obtained or synthesized. Two, enzymatic synthesis of 
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such structures will produce only the desired product and the 
nucleotide diphosphate product of substrate hydrolysis. This 
latter chemical is found as the natural by-product of these 
reactions in animal cells, is relatively non-toxic, and may be 
easily separated from the oligosaccharide synthetic product. 
By contrast, chemical synthetic procedures typically generate 
numerous products of side reactions which must be removed, and 
which may be toxic as well* Similarly, purification of 
oligosaccharides from natural sources requires the removal of 

other contaminating oligosaccharides present in the natural 
material . 

Three, enzymatic catalysis is extraordinarily efficient; 
nearly or totally complete conversion of substrate to product 
might be achieved. By contrast, chemical synthesis of sub- 
terminal a (1,3) and a (1,4) fucose residues on oligosaccharides 
is a multi-step process; yields at each step may be much less 
than 100%, and the cumulative efficiency of current chemical 
synthesis procedures does not approach the efficiency possible 
with enzymatic synthesis. Similarly, purification of 
oligosaccharides with sub-terminal a (1,3) and a (1,4) fucose 
residues from natural materials can entail significant losses 
inherent to the purification procedures required to separate 
the desired oligosaccharide from contaminating, irrelevant 
oligosaccharides, with inefficient isolation of the desired 
oligosaccharide . 
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Although the GDP-Fuc: [0-D-Gal (1,4/1, 3) ]-D-GlcNAc(/Glc) 
a (1,3 /I, 4) -fucosyltransf erase encoded by the DNA sequence 
described here may be isolated from animal tissues for 
synthetic use, these purifications are themselves inefficient, 
primarily because the enzyme is typically present in very low 
abundance - 

This invention provides two mechanisms that provide for 
the abundant production of this enzyme. First, this may be 
done through the construction and selection of animal cells 
that produce relatively large quantities of the enzymes. 
Alternatively, this cloned nucleic acid sequence may be used 
with standard recombinant DNA technologies to produce large 
quantities of the fucosyltransf erase in yeasts or in 
prokaryotic hosts. Furthermore, the sequence encoding this 
enzyme may be modified via standard molecular cloning schemes 
or mutagenesis to yield a recombinant fucosyltransf erase with 
novel properties that make it more desirable than the 
wild-type enzyme. 

For example, modifications can be made to the enzyme that 
make it more stable, or more suitable for immobilization in a 
bioreactor. 

(iii.) Isolation of reagents suitable for producing 
recombinant GDP-Fuc: [0-D-Gal(l, 4/1, 3) ]-D-GlcNac(/Glc) 
a (1,3/1, 4) -fucosyltransf erase to be used directly as a 
research reagent, or to be used to generate antibodies against 
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the GDP-Fuc: [/3-r Gal(l,4/1 # 3) ]-D-GlcNAc(/Glc) ot(l,3/l,4)- 
fucosyltransf erase, for research applications. 

This invention provides two mechanisms for producing 
large quantities of this enzyme (see ii. above - i.e., 
specially constructed animal cells, or via natural or 
synthetic genes encoding these enzymes) which may be used as a 
research tool with which to study the structures and functions 
of oligosaccharides and glycoproteins. Likewise, the enzyme 
produced by this method, or the nucleic acid sequence and 
derived protein sequence provided by this method, may be used 
to generate antibodies to this enzyme (via immunization with 
synthetic peptides whose sequences are derived from the cloned 
gene(s) or cDNA(s) , or by immunization with the recombinant 
enzyme itself) . These antibodies might also be used as 
research reagents to study the biosynthesis and processing of 
these enzymes, and can be used as an aid in their purification 
for all the uses described in this text. 

(iv.) Antibodies to glycosyltransf erases as diagnostic 



Aberrant expression of GDP-Fuc: (0-D-Gal(l, 4/1,3) ]- 
D-GlcNac(/Glc) a (1, 3/1,4) -fucosyltransf erase has been 
associated with malignancy in humans, suggesting that this 
enzyme might serve as a tumor marker for early detection of 
malignancy involving a number of human tissues. Enzyme tumor 
markers have typically been assayed in body fluids by activity 
assays, which may be subject to non-specificity due to 
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competing or similar glycosyltransf erase activity. These 
assays may also be insensitive since it is possible that 
inactive enzymes might be useful as tumor markers but would 
not be detected by enzyme activity assays. 

This invention provides a mechanism for generating 
antibodies to this enzyme (monoclonal and polyclonal 
antibodies against synthetic peptides constructed from 
information derived from cloned OKA sequence encoding GDP-Fuc: 
0-D-Gal ( 1 , 4/ 1 , 3 ) ] -D-GlcNac ( /Glc) 

a ( 1, 3 / l, 4 )-f ucosy Itransf erase, or against the recombinant 
enzyme produced by eukaryotic or prokaryotic hosts) . 
Antibodies specific for this GDP-Fuc: [0-D-Gal (1,4/1, 3) ]- 
D-GlcNac(/Glc) at ( 1 , 3 / 1 , 4 ) -f ucosy Itransf erase produced could be 
used to detect and quant itate this glycosyltransf erase in body 
fluids, with specificity and sensitivity exceeding enzyme 
activity assays, and with the possibility of serving as a 
tumor marker for early detection of malignancy. 

(v.) Recombinant enzyme for use in screening natural and 
synthetic compounds for f ucosy Itransf erase inhibitors or 



A number of studies have noted an association between 
increased numbers of cell surface sub-terminal a (1,3) and a 
(1,4) fucose residues on oligosaccharides of a cell and the 
ability of that cell to metastasize in a malignant fashion. 
If there is a causal relationship, then drugs that inhibit the 
enzyme encoded by the sequence in this text could be active as 



WO 97/09421 



# 




/US96/13816 



-73- 



anti-tumor agents. Likewise, numerous recent studies have 
implicated sialylated and neutral oligosaccharides containing 
subterminal a (1,3) and a (1,4) fucose linkages in mediating 
adhesion of leukocytes to the selectin adhesion molecules 
(ELAM-1; GMP-140; Mel 14 / LAM- 1) during inflammation. These 
studies suggest that molecules capable of preventing synthesis 



of a (1,3) and a (1,4) fucose linkages on leukocytes may thus 
function to diminish or even eliminate the ability of 
leukocytes to synthesize and display subterminal a (1,3) and 
a (1,4) fucose linkages, and would thus represent anti- 
inflammatory pharmaceutical agents. The reagents described in 
this text are useful for screening to isolate compounds or 
identify compounds that exhibit antifucosyltransf erase 
activity, since the cloned sequence may be used with standard 
techniques to produce relatively large amounts of pure 
fucosyltransf erase. This further aids in screening since the 
effects of potential inhibitors will be tested on a pure 
enzyme, without the confounding effects that may occur in 
whole cell extracts or with partially purified enzyme. 

(vi.) Engineering of glycosyl transferase substrate 
specificity to generate novel glycocon jugate structures on 
secreted or cell-associated glycoconjugates. 

This invention provides a reagent a) cloned GDP-Fuc:[j8- 
-D-Gal (1,4/1,3)] -D-GlcNac ( /Glc) a ( 1 , 3 / 1 , 4 ) -Fucosyltransf erase 
cDNA) and the genetic selection method used to isolate it, 
that, when used with appropriate mutagenesis schemes, may 
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allow the generation of mutant GDP-Fuc: [0-D-Gal(l, 4/1,3) ]-D- 
GlcNac(/61c) a (1, 3/1, 4) -Fucosyl transferases that generate 
glycosidic linkages different from that generated by the 
wild-type enzyme. These novel linkages may or may not be 
naturally occurring , and might find utility as moieties that 
enhance bioactivity of the molecules to which they are 
attached . 

Alternatively, mutagenesis and selection approaches may 
be used to generate mutant GDP-Fuc: [j8-D-Gal (1,4/1, 3) ]-D- 
GlcNac(/Glc) a (1,3/1, 4 ) -Fucosyltransf erases that act in a 
dominant negative fashion. The dominant negative mutants so 
generated can be used to inactivate endogenous 
glycosyltransf erase activities when the product (s) of such an 
enzyme are not desired. Mutant GDP-Fuc: [0-D- 
Gal (1,4/1,3)] -D-GlcNac ( /Glc) a ( 1 , 3 / 1 , 4 ) -Fucosyl transferases 
can also be generated, for example, that function as 
fucosidases that hydro lyze various sugar linkages (fucose, 
mannose, or others) from oligosaccharides in vitro and in 
vivo . 

(vii.) Geno typing individuals for the Lewis locus. 

DNA sequence polymorphisms within or linked to the gene 
corresponding to this cloned cDNA may be used to genotype 
individuals for the Lewis locus. This can find utility with 
respect to organ transplantation procedures, or as a measure 
of susceptibility to infections caused by pathogens that may 
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use blood group structures as receptors for invasion (as in 
urinary tract infections , for example) , 

SEQ ID NO: 3 encodes a mouse UDP~Gal:£«D~Gal(l,4)~D~GlcNAc 
ffi(l,3)-galactosyltransferase. SSQ ID N0s5 encodes a human 
GDP-Fuc s act o side a ( l , 2 ) -f ucosyltransf erase . The uses 



for each of these proteins are generally the same as those 
discussed herein for the enzyme of SEQ ID NOsl. 

Specific application of the enzyme encoded by SSQ ID W0s5 
include enzymatic fucosylation of chain-terminating galactose 
residues on lactoseamine or neolacto type £°D~galactoside to 
ce-2-L-fucose residues. Such modification can be performed jba 
vitro using the purified cz (1,2) FT, or its derivatives and its 



substrate GDP-fucose, using asialoglycans terminating with j3~ 
D-galactoside residues. Such asialoglycans exist naturally, 
and can also be constructed from glycans with terminal 
galactose moieties substituted with sialic acid by in vitro 
digestion with neuraminidase. Likewise, such fucosylation can 
be expected to occur when glycans are expressed in mammalian 
cells that have been transfected with the a(l,2)FT cDNA or 
gene segment. Such a (1,2) fucosylated glycans may have 
increased solubility properties, may have prolonged plasma 
half lifes (by virtue of the fact that the terminal galactose 
residues that normally mediate glycoprotein clearance by the 
asialoglycoprotein receptor of the liver are now covered up by 
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fucose residues), differentiate from natural glyco forms, and 
may enhance bioactivity. 

Molecular mechanisms used by cells to regulate the 
precise tissue-specific and developmental expression patterns 
of oligosaccharide structures are poorly understood. Such 
patterns however are probably determined largely by the 
coordinate regulation of expression of cognate 
glycosyltransf erases. Since many of these enzymes recognize 
identical nucleotide sugar substrates or oligosaccharide 
acceptor substrates, it can be expected that they exhibit 
substantial primary protein and nucleic acid sequence 
similarities that would facilitate isolation of related 
glycosyltransf erase genes by cross hybridization strategies* 

Molecular cloning efforts by the inventor have allowed 
the isolation of several cloned glycosyltransf erase cDNAs 
discussed above. Comparisons of the primary sequences of 
these enzymes reveals that they maintain virtually identical 
predicted structural topologies. 

With the exception of one pair of distinct 
glycosyltransf erases, however, there appear to be no 
substantial primary sequence similarities between these 
enzymes, even though many of them exhibit nucleotide sugar 
substrate or oligosaccharide acceptor substrate requirements 
that are virtually identical. The exceptional pair, a murine 
al,3 galactosyltransf erase sequence, or its human pseudogene 
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homologue, and a human al,3N-acetylgalactosaminide transferase 
share substantial primary protein and nucleic acid sequence 
similarity, even though these enzymes use different nucleotide 
sugar substrates and exhibit distinct oligosaccharide acceptor 
substrate requirements. 

Taken together, these observations indicate that some 
glycosyltransf erases may be structurally related, but such 
relationships cannot necessarily be predicted from knowledge 
of nucleotide sugar or oligosaccharide acceptor substrate 
requirements . 

As noted above, the gene transfer procedure of the 
invention has been used by the inventor to isolate a cloned 
cDNA that encodes the human Levis blood group 
fucosyltransf erase. This enzyme is an exceptional 
glycosyltransf erase in that it can catalyze two distinct 
transglycosylation reactions. Sequence comparisons of this 
enzyme with the blood group H a(l, 2) fucosyltransf erase 
indicates that these two fucosyltransf erases maintain distinct 
primary sequences despite the fact that they use the identical 
nucleotide sugar substrate GDP-fucose, and can each utilize 
oligosaccharide acceptor molecules that terminate with 
unsubstituted type I or type II disaccharide moieties. 

Biochemical and genetic data indicate that the human 
genome contains one or more structural genes that encode 
fucosyltransf erases competent to construct surface localized 
SSEA-l determinants. These enzymes are thought to be 
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polypeptides distinct from the Levis fucosyltransf erase 
because they exhibit different acceptor substrate 
specificities and differential sensitivities to divalent 
cation and N-ethylmaleimide inactivation. Moreover, their 
expression is determined by loci distinct from the Lewis blood 
group fucosyltransf erase locus, and they display tissue 
specific patterns that are distinct from the Levis locus 
patterns . 

Because these enzymes exhibit properties that are very 
similar to the Levis blood group fucosyltransf erase, the 
inventor recognized that it vas possible that these enzymes 
and their corresponding genes might be sufficiently related at 
the primary sequence level to be able to isolate them by 
crosshybridization approaches. In another embodiment, the 
invention therefore provides a method for isolating a gene 
encoding a glycosyltransf erase by cross-hybridization. The 
cross-hybridization techniques vhich can be used in accordance 
with the invention are generally known. See, e.g. p Lauer et 
al. Cell (1980) 20:119-130, Fritsch et al . Cell (1980) 
19:959-972, Havnes et al , J. Biol . Chem. (1980) 255:6355-6367, 
and Proudfoot et al . Proc. Nat. Acad. Sci. (USA) (1979) 
76:5425-5439, all of which are hereby incorporated by 
reference. 

As noted above, oligosaccharides constructed by animal 
cells are remarkable for their structural diversity. This 
diversity is not random but rather consists of specific sets 
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of oligosaccharide structures that exhibit precise tissue- 
specific and developmental expression patterns. Molecular 
mechanisms used by cells to regulate these expression patterns 
are poorly understood. It can be expected, however, that such 
patterns are determined largely by the coordinate regulation 
of expression of the glycosyltransf erases that determine these 
patterns. Recent molecular cloning efforts have allowed the 
isolation of several cloned glycosyltransf erase cDNAs . 
Comparisons of the primary sequences of these enzymes have 
revealed that they maintain virtually identical predicted 
structural topologies. With the exception of one pair of 
distinct glycosyltransf erases, however, there appear to be no 
substantial primary sequence similarities between these 
enzymes, even though many of them exhibit nucleotide sugar 
substrate or oligosaccharide acceptor substrate requirements 
that are virtually identical. The exceptional pair, a murine 
al, 3galactosyltransf erase sequence, or its human pseudogene 



share substantial primary protein and nucleic acid sequence 
similarity, even though these enzymes use different nucleotide 
sugar substrates and exhibit distinct oligosaccharide acceptor 
substrate requirements. Taken together, these observations 
suggest that some glycosyltransf erases may be structurally 
related, but such relationships cannot necessarily be 
predicted from knowledge of nucleotide sugar or 
oligosaccharide acceptor substrate requirements. The inventor 



homologue, and a human al,3N-acetylgalactosaminide transferase 
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has recently used a mammalian gene transfer procedure to 
isolate a cloned cDNA that encodes the human Lewis blood group 
fucosyltransf erase (Kukowska-Latallo et al, Genes Devel . , 
4:1288-1303, 1990). This enzyme is an exceptional 
glycosyltransf erase in that it can catalyze two distinct 
transglycosylation reactions. Sequence comparisons of this 
enzyme with the blood group H ot(l, 2) fucosyltransf erase (Lars en 
et al f Proc. Natl. Acad. Sci. USA . 87:6674-6678, 1990) 
indicates that these two fucosyltransf erases maintain distinct 
primary sequences despite the fact that they use the identical 
nucleotide sugar substrate GDP-fucose, and can each utilize 
oligosaccharide acceptor molecules that terminate with 
unsubstituted type I or type II disaccharide moieties 



(Kukowska-Latallo et al, Genes Devel. . 4:1288-1303, 1990; 
Larsen et al, Proc. Natl. Acad. Sci. USA , 87:6674-6678, 1990). 
Biochemical and genetic data indicate that the human genome 
contains one or more structural genes that encode 



SSEA-1 determinants (Kukowska-Latallo et al, Genes Devel. , 
4:1288-1303, 1990; Potvin et al, J. Biol. Chem. , 265:1615- 
1622, 1990). These enzymes are thought to be polypeptides 
distinct from the Lewis fucosyltransf erase because they 
exhibit different acceptor substrate specificities and 
differential sensitivities to divalent cation and N- 
ethylmaleimide inactivation (Potvin et al, J. Biol. Chem.. 
265:1615-1622, 1990). Moreover, their expression is 



fucosyltransf erases competent to construct surface localized 
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determined by loci distinct from the Levis blood group 
fucosyltransf erase locus, and they display tissue specific 
patterns that are distinct from the Lewis locus patterns 
(Watkins, ftriv. ffiti m. Genet. . 10:1-116, 1980). Because these 
enzymes exhibit properties that are very similar to the Lewis 
blood group fucosyltransf erase, the inventor considered it 
possible that enzymes and their corresponding genes might be 
sufficiently related at the primary sequence level to be able 
to isolate them by cross-hybridization approaches. The 
inventor has now achieved the isolation of several such cross- 
hybridizing human genes, an analysis of their structures, 

4 

their expression in COS-l cells after DEAE-dextran-mediated 
transfection, and analysis of their acceptor substrate 
properties. 

Thus, in another embodiment, the invention provides DNA 
sequence SEQ ID NO: 7 (set forth in Figure 4) that encodes a 
protein sequence capable of functioning as a GDP-Fuc: [/3-D- 
Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosyltransf erase , Fuc-TIV. This 
enzyme, when expressed by the cloned DNA sequence SEQ ID NO: 7 
functions within mammalian cells to generate de novo 
expression of specific cell surface glycocon jugate structures 
of those cells. 

Present DNA sequence SEQ ID NO: 9 (set forth in Figure 5 
wherein it is indentif ied as pFT-3 DNA) is comprised within 
DNA sequence SEQ ID NO: 7. Namely, sequence SEQ ID NO: 9 is 
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found beginning at nucleotide position 1942 (the codon at 
nucleotide position 1942-1944) in sequence SEQ ID NO: 7. 

The DNA sequence and corresponding peptide provided in 
this embodiment of the invention must correspond at least to 
the segment from nucleotide position 2089 to 3159, preferably 



positions 1942 to 3156, of Sequence SEQ ID N0^7 set forth in 
Figure 4. These DNA sequences, can have further DNA sequence 
attached optionally to each end. These pendent DNA sequences 
can be of any length and up to a length corresponding to that 
set forth in Figure 4. 

In a preferred embodiment, this embodiment of the 
invention provides DNA sequences, and their corresponding 
proteins, corresponding at least to the sequence between 



nucleotide positions 2089 to 3159, preferably positions 1942 
to 3156, SEQ ID NO: 7 and having attached to each end, 
optionally, further DNA sequences corresponding to those set 
forth in Figure 4. In this case, the pendent DNA sequences 
and corresponding proteins, can be of any length and up to the 
length set forth in Figure 4. 

These glycocon jugate structures, constructed in part by 
this enzyme, are recognized by an antibody against the stage 
specific embryonic antigen I (SSEA-1 or Lewis x; structure 
GaljS (1, 4) [Fuc or(l,3) ]GlcNAc) , and by an antibody against the 
VIM-2 determinant NeuAca(2,3)Gal0(l,4)GlcNAcj3- 
(l,3)Galj3(l,4) [Fuca(l,3) ]GlcNAc. This enzyme when expressed 
by DNA SEQ ID NO: 7 functions in the enzymatic manner implied 
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in its name, when assayed in extracts prepared from cells that 
express the DNA sequence, as illustrated in Figure 8 and Table 
2. 

The oligosaccharide products of this enzyme represents 
fucose linked in alpha 1,3 configuration to the GlcNac residue 
of a "type II", lactos [ amine ] acceptor. Throughout the 
remainder of this disclosure, these products will be referred 
to as subterminal a (1,3) fucose residues. 

The isolation of three specific such cross-hybridizing 
human genes (SEQ ID NO: 7, SEQ ID NO: 10, SEQ ID NO: 13), an 
analysis of their structure, their expression in COS-l cells 
after DEAE-dextran-mediated transfection, and analysis of 
their acceptor substrate properties is described in the 
examples below, (Examples IV, V, and VI) , and summarized in 
Figures 8 and Table 2. SEQ ID NO: 10 (Fuc-TV) encodes a 
specific protein sequence capable of functioning as a 
GDP-Fuc: [0-D-Gal(l, 4) ]-D-GlcNAc ar(l, 3) fucosyltransf erase. 
This enzyme, when expressed by the cloned DNA sequence 
described, functions within mammalian cells to generate de 
novo expression of specific cell surface glycoconjugate 
structures on those cells. These structures are recognized by 
an antibody against the stage specific embryonic antigen I 
(SSEA-1 or Lewis x; structure Gal£(l,4) [Fuca(l, 3) ]GlcNAc) , and 
by an antibody against the sialyl-Lewis x determinant 
NeuAca(2,3)Galj8(l,4) [Fuca ( 1 , 3 ) ]GlcNAc. This enzyme, when 
expressed by the cloned DNA sequence described, also functions 
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in the enzymatic manner implied in its name, when assayed in 
extracts prepared from cells that express the DNA sequence. 
The oligosaccharide products of this enzyme represent fucose 
linked in alpha 1,3 configuration to the GlcNac residue of a 
"type II' lactosf amine] acceptor. Throughout the remainder of 
this disclosure, these products will be referred to as 
subterminal a (1,3) fucose residues. The location of the 
catalytic domain of this enzyme has been shown experimentally 
to encompass amino acids 43 to 374 of SEQ: ID NO:ii. 

The DNA and encoded protein of SEQ ID NO: 13 and SEQ ID 
NO: 14 (Fuc-TVI) may be used as follows: 

i. Construction of animal cell lines with specific 
capabilities with respect to post- trans la tional modification 
of the oligosaccharides on cell-surface, intracellular, or 
secreted proteins or lipids by sub-terminal a- (1,3) fucose 
residues that represent the products of this enzyme (for the 
production of diagnostics and therapeutics by the 
biotechnology industry) . 

Specifically, the cloned DNA sequence described here may 
be introduced by standard technologies into a mammalian cell 
line that does not normally express the cognate enzyme or its 
product (sub- terminal a (1,3) fucose residues on 
oligosaccharides) , and transcribed in that cell in the "sense ' 
direction, to yield a cell line capable of expressing 
sub-terminal a (1,3) fucose residues residues on 
oligosaccharides on cell-surface, intracellular, or secreted 
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proteins or lipids. Alternatively, this cloned DNA sequence 
may be introduced by standard technologies into a mammalian 
cell line that does express the cognate enzyme and its product 
(subterminal a (1,3) fucose residues), and transcribed in that 
cell in the "antisense' direction, to yield a cell line 
incapable of expressing sub-terminal a (1,3) and fucose 
residues on cell-surface, intracellular, or secreted proteins 
or lipids. Alternatively, the endogenous 
GDP-Fuc : [ 0-D-Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosy ltransf erase 
gene(s) , in a mammalian cell expressing the cognate enzyme (s), 
might be inactivated with the DNA sequence described here by 
homologous recombination techniques, or by "antisense' 
oligonucleotide approaches based upon the DNA sequence 
described herein, or by dominant negative mutant 
fucosyltransf erase sequences that inactivate endogenous 
GDP-Fuc : [ 0-D-Gal ( 1 , 4 ) ] -D-GlcNAc a ( 1 , 3 ) -Fucosy ltransf erase ( s) 
and that may be derived via mutagenesis and genetic selection 
schemes , in conjunction with the sequence information in this 
Disclosure. 

This method could be used to construct animal cell lines 
that will be suitable host cells for the production of 
diagnostic or therapeutic materials whose usefulness or 
efficacy depends upon the specific post-translational 
modification determined by this cloned DNA sequence and its 
cognate enzyme. For example, it is known that that the 
biological effectiveness of many therapeutic proteins or 
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peptides, recombinant or otherwise, may depend critically upon 
the oligosaccharide structure ( s ) that are covalently attached 
to them. The structure of these oligosaccharides is primarily 
a function of the number and kind of glycosyltransf erase 
enzymes that are found in the cell used to produce these 
therapeutic products. Animal cells and yeasts are competent 
to perform these glycosylation reactions; however, not all 
glycosyltransf erase enzymes are produced by every animal cell 
or yeast, and therefore, some oligosaccharide structures 
(including sub-terminal a (1,3) fucose residues generated by 
the enzyme encoded by the DNA sequence described here) are not 
produced by them. The converse is also true, namely, that 
producing cells may express a glycosyltransf erase analogous 
to, or identical to, the GDP-Fuc: [0-D-Gal(l, 4) ]-D-GlcNAc 
a(l,3) -Fucosyl transferase encoded by the DNA sequence 
described here. It is likely that sub-terminal a (1,3) fucose 
residues may alter the bioactivity (for better or for worse) 
of natural or recombinant therapeutic or diagnostic agents 
(glycoproteins or glycolipids) produced by mammalian or other 
eukaryotic hosts. Eukaryotic host cells that the 
biotechnology industry uses to produce these recombinant 
agents may be altered with the DNA sequence information and 
related information described in this invention, to add 
sub-terminal a (1,3) fucose residues to the oligosaccharides on 
recombinant product by expressing all or part of the cloned 
sequences described here in the desired host. Alternatively, 
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sub-terminal a (1,3) fucose residues may be eliminated from the 
product produced in these host cells by the use of trans fee ted 
"anti-sense" vector constructs, recombination-based gene 
inactivation, 'anti-sense' oligonucleotide approaches, or 
dominant negative mutant fucosyltransf erases, outlined above. 

The old 'methods' used for this process include an 
empirical approach to identify a cell line that does or does 
not express this particular enzyme or an enzyme that functions 
in a similar or identical manner, for the production of the 
appropriately modified recombinant or natural product. This 
is not always optimal since cell lines with this particular 
post-translation modification capabilities may not exist 
naturally, or may not be especially suited to high level 
production of an appropriately modified product. 
Alternatively, unwanted sub-terminal a (1,3) fucose residues 
present on a therapeutic material produced by an empirically 
identified animal cell line must be removed chemically or 
enzymatically, a process that may be costly or inefficient. 
The advantages of using the cloned, functional DNA sequence 
described here in conjunction with the technologies outlined 
above, relative to these older methods, include the ability to 
construct lines that specifically lack the capability to 
generate sub-terminal a (1,3) fucose residues on the 
oligosaccharides of glycoproteins and glycolipids; properly 
constructed, these cell lines will eliminate any need for 

r 

\ 

chemical or enzymatic treatment of a therapeutic or diagnostic 
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material to remove unwanted sub-terminal a(l,3)fucose 
residues. Moreover, in the event that sub- terminal 
a(l,3)fucose residues residues are found to be desirable for a 
particular diagnostic or therapeutic product produced by 
animal cells, cell lines may be engineered with the cloned DNA 
sequence described here to generate these residues. 

ii. Isolation of reagents suitable for efficient 
enzymatic synthesis and production of oligosaccharides (in 
enzyme reactors, for example) . 

Oligosaccharides may have therapeutic utility as 
immunomodulatory reagents in the field of organ 
transplantation. In particular, soluble and solid-phase 
oligosaccharides may find use as therapeutic agents with which 
to block or ameliorate antibody-mediated organ transplant 
rejection in cases involving incompatibility due to 
differences in the major blood group antigen systems of the 
organ donor and the recipient, including the Lews blood group 
system. Likewise, soluble oligosaccharides may find use as 
therapeutic agents that function by blocking attachment of 
bacterial, viral, or parasitic pathogens to glycocon jugate 
'receptors" found on the surface of the animal tissues that 
these pathogens invade. For example there is evidence that 
portions of the Lewis blood group oligosaccharide antigen 
(containing sub-terminal oe(l,3) fucose residues) serve as 
"receptors" for some forms of uropathogenic bacteria. 
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Moreover , glycocon jugates , including sub-terminal a (1,3) fucose 
residues, have been implicated in modulating adhesive events 
between cells, and between cells and their environment during 
developmental and differentiation processes. These events 
included binding of spermatozoa to eggs, and the initial 
events that mediate attachment of fertilized ova to the 
uterine wall at the beginning of implanatation. These 
observations suggest, for example, the possibility that 
contraceptive uses for (biologically "natural") 
oligosaccharide molecules might exist. In addition, specific 
glycocon jugates containing sub- terminal a (1,3) fucose residues 
have been implicated as ligands for the LECCAM/ Select in family 
of adhesion molecules , that play important roles in mediating 
adhesion between cells of the immune system, and some tumor 
cells, and the surfaces of the endothelial cells that line the 
vascular tree. Thus, the cloned fucosyltransf erase sequence 
described here may be used to construct oligosaccharide- type 
molecules, with pharmaceutical properties possessing 
ant i- inflammatory and anti-tumor metastatic functions. 

Currently, oligosaccharides containing sub-terminal 
a (1,3) fucose residues are produced by chemical synthesis (a 
procedure that is inefficient and costly) or by isolation from 
natural sources (using costly and inefficient procedures that 
often require the processing of large quantities of animal or 
plant material, and the purification of the desired 
oligosaccharide from other contaminating oligosaccharides) . 
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The invention described here provides a mechanism to 
synthesize abundant quantities of purified 
GDP-Fuc: [0-D-Gal(l,4) ]-D-GlcNAc a (1, 3) -Fucosyltransf erase. 
This could be used to construct an enzyme bioreactor (enzyme 
in solution or immobilized on a solid phase matrix, for 
example via the protein-A moiety fused to the catalytic domain 
of the enzyme, as described in previous manuscripts published 
by the Inventor John Love) capable of enzymatic synthesis of 
structures containing sub-terminal a (1,3) fucose residues. 
This may be more efficient than approaches involving chemical 
synthesis of structures containing sub-terminal a (1,3) fucose 
residues or their purification from natural sources, for a 
variety of reasons. One, the only chemicals necessary would 
be the enzyme substrates; these are easily obtained or 
synthesized. Two, enzymatic synthesis of such structures will 
produce only the desired product and the nucleotide 
diphosphate product of substrate hydrolysis. This latter 
chemical is found as the natural byproducts of these reactions 
in animal cells, is relatively non-toxic, and may be easily 
separated from the oligosaccharide synthetic product. By 
contrast, chemical synthetic procedures typically generate 
numerous products of side reactions which must be removed, and 
which may be toxic as well. Similarly, purification of 
oligosaccharides from natural sources requires the removal of 
other contaminating oligosaccharides present in the natural 
material. Three, enzymatic catalysis is extraordinarily 
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efficient; essentially complete conversion of substrate to . 
product night be achieved. By contrast, chemical synthesis of 
sub-terminal a (1,3) fucose residues on oligosaccharides is a 
multi-step process; yields at each step may be much less than 
100%, and the cumulative efficiency of current chemical 
synthesis procedures does not approach the efficiency possible 
with enzymatic synthesis. Similarly, purification of 
oligosaccharides with subterminal a (1,3) fucose residues from 
natural materials can entail significant losses inherent to 
the purification procedures required to separate the desired 
oligosaccharide from contaminating, irrelevant 
oligosaccharides, with inefficient isolation of the desired 
oligosaccharide . Although the GDP-Fuc : [ jS-D-Gal (1,4)] -D-GlcNAc 
a ( 1 , 3 ) -Fucosy ltransf erase encoded by the DNA sequence 
described here may be partially purified from animal tissues 
for synthetic use, these purifications are themselves 
inefficient, primarily because the enzyme is typically present 
in very low abundance* This invention provides two mechanisms 
that may provide for the abundant production of this enzyme. 
First, this may be done through the construction and selection 
of animal cells that produce relatively large quantities of 
the enzymes. Alternatively, this cloned nucleic acid sequence 
may then be used with standard recombinant DNA technologies to 
produce large quantities of glycosy ltransf erases in yeasts or 
in prokaryotic hosts. Furthermore, the sequence encoding this 
enzyme may be modified via standard molecular cloning schemes 
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or mutagenesis to yield a recombinant fucosyltransferase with 
novel properties that make it more desireable than the 
wild-type enzyme. For example, the modifications might be 
made to the enzyme that make it more stable, or more suitable 
for immobilization in a bioreactor. 

iii. Isolation of reagents suitable for producing 
recombinant GDP-Fuc : [ 0-D-Gal (1,4)] -D-GlcNAc 
a(l, 3) -Fucosyltransferase to be used directly as a research 
reagent, or to be used to generate antibodies against the GDP- 
Fuc : [ 0-D-Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosyltransferase , for 
research applications. 

This invention provides two mechanisms for producing 
large quantities of this enzyme (see ii. above - i.e. 
specially constructed animal cells, or via natural or 
synthetic genes encoding these enzymes) which may be used as a 
research tool with which to study the structures and functions 
of oligosaccharides and glycoproteins. Likewise, the enzyme 
produced by this method, or the nucleic acid sequence and 
derived protein sequence provided by this method, may be used 
to generate antibodies to this enzyme (via synthetic 
peptides) . These antibodies might also be used as research 
reagents to study the biosynthesis and processing of these 
enzymes, and might be used as an aid in their purification for 
all the uses described in this disclosure. 
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iv. Antibodies to glycosyltransf erases as diagnostic 
reagents . 

Aberrant expression of GDP-Fuc: [0-D-Gal(l, 4) ]-D-GlcNAc 
a(l, 3) -Fucosyltransf erase has been associated with malignancy 
in humans, suggesting that this enzyme might serve as a tumor 
marker for early detection of malignancy involving a number of 
human tissues. Enzyme tumor markers have typically been 
assayed in body fluids by activity assays, which may be 
subject to non-specificity due to competing 
glycosyltransf erase activity. These assays may also be 
insensitive since it is possible that inactive enzymes might 
be useful as tumor markers but would not be detected by enzyme 
activity assays. This invention provides a mechanism for 
generating antibodies to this enzyme (monoclonal and 
polyclonal antibodies against synthetic peptides constructed 
from information derived from cloned DNA sequence encoding 
GDP-Fuc : [0-D-Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosyltransf erase , or 
against the recombinant enzyme produced by eukaryotic or 
prokaryotic hosts) . Antibodies specific for this 
GDP-Fuc: [0-D-Gal (1,4) ]-D-GlcNAc a (1, 3) -Fucosyltransf erase so 
produced could be used to detect and quantitate this 
glycosyltransf erase in body fluids, with specificity and 
sensitivity exceeding enzyme activity assays, and with the 
possibility of serving as a tumor marker for early detection 

of malignancy. 
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v. Recombinant enzyme for use in screening natural and 
synthetic compounds for fucosyltransf erase inhibitors or 
inactive tors. 

A number of studies have noted an association between 
increased numbers of cell surface sub-terminal a(l,3) fucose 
residues on oligosaccharides of a cell and the ability of that 
cell to metastasize in a malignant fashion. If there is a 
causal relationship here, then it may be possible that drugs 
that inhibit the enzyme encoded by the sequence in this 
disclosure might be active as anti-tumor agents. Likewise, 
numerous recent studies have implicated sialylated and neutral 
oligosaccharides containing subterminal a (1,3) and a (1,4) 
fucose linkages in mediating adhesion of leukocytes to the 
select in adhesion molecules (ELAM-1; GMP-140; Mel 14 / LAM- 1) 
during inflammation. These studies suggest that molecules 
capable of preventing synthesis of a (1,3) and a(l f 4) fucose 
linkages on leukocytes may thus function to diminish or even 
eliminate the ability of leukocytes to synthesize and display 
subterminal a (1,3) and a (1,4) fucose linkages, and would thus 
represent anti-inflammatory pharmaceutical agents. The 
reagents described in this disclosure may prove useful for 
screening to isolate or identify compounds that exhibit 
anti-fucosyltransf erase activity, since the cloned sequence 
may be used with standard techniques to produce relatively 
large amounts of pure fucosyltransf erase. This will aid in 
screening since the effects of potential inhibitors will be 
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tested on a pure enzyme, without the confounding effects that 
may occur in whole cell extracts or with partially purified 
enzyme. 

vi. Engineering of glycosyl transferase substrate 
specificity to generate novel glycocon jugate structures on 
secreted or cell-associated glycocon jugates. 

This invention provides a reagent (a cloned 
GDP-Fuc: [jff-D-Gal(l,4) ]-D-GlcNAc a ( 1 , 3 ) -Fucosyltransf erase gene 
segment) , that, when used with appropriate mutagenesis and 
genetic selection schemes, may allow the generation of mutant 
GDP-Fuc : [0-D-Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosyltransf erases 
that generate glycosidic linkages different from that 
generated by the wild-type enzyme. These novel linkages may 
or may not be naturally occur ing, and could find utility as 
moieties that enhance the bioactivity of the molecules to 
which they are attached. Directed mutagenesis procedure may 
also be considered since this enzyme maintains primary 
sequence similarity to other a (1, 3) -Fucosyltransf erases, yet 
exhibits a distinct set of acceptor substrate utilization 
properties. Alternatively, mutagenesis and selection 
approaches may be used to generate mutant 

GDP-Fuc: [0-D-Gal(l,4) ]-D-GlcNAc a (1,3) -Fucosyltransf erases 
that act in a dominant negative fashion. The dominant 
negative mutants so generated might be used to inactivate 
endogenous glycosyltransf erase activities when the product (s) 
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of such an enzyme are not desired. Mutant 

GDP-Fuc : [ /3-D -Gal (1,4)] -D-GlcNAc a ( 1 , 3 ) -Fucosy ltransf erases 

might also be generated, for example, that function as 

fucosidases that hydrolyze various sugar linkages (fucose, 

mannose, or others) from oligosaccharides in vitro and in 

vivo. 

vii. Gen o typing individuals at this fucosy ltransf erase 
locus • 

Absence of a fucosy ltransf erase similar or identical to 
the one encoded by the DNA sequence detailed here has been 
found in several families. Should such absence be associated 
with a detrimental phenotype, DNA sequence polymorphisms 
within or linked to the gene corresponding to this cloned gene 
segment may be used to genotype individuals at this locus, for 
the purpose of genetic counseling. Likewise, the molecular 
basis for any such detrimental phenotypes might be elucidated 
via the study of the gene segment described here, should it be 
causally-related to such phenotypes. 

In another embodiment, the present invetnion provides 
novel chimeric fucosy ltransf erases. 

Five different human a (1,3) fucosy ltransf erase (cr(l,3)Fuc- 
T) genes have been cloned. Their corresponding enzymes 
catalyze the formation of various a (1,3)- and o(l,4)- 
fucosylated cell surface oligosaccharides, including several 
that mediate leukocyte-endothelial cell adhesion during 
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inflammation. Inhibitors of such enzymes are predicted to 
operate as anti-inflammatory agents; in principle, the 
isolation or design of such agents may be facilitated by 
identifying peptide segment (s) within these enzymes that 
interact with their oligosaccharide acceptor substrates. 
Little is known, however, about the structural features of 
or(l, 3) Fuc-Ts that dictate acceptor substrate specificity. To 
begin to address this problem, we have created and 
functionally characterized a series of twenty-one recombinant 
a(l,3)Fuc-T chimeras derived from three human a (1, 3) Fuc-Ts 
(Fuc-TIII, Fuc-TV, and Fuc-TVI) that maintain shared and 
distinct polypeptide domains and that exhibit common as well 
as idiosyncratic acceptor substrate specificities. The in 
vivo acceptor substrate specificities of these a(l,3)Fuc-T 
chimeras, and of their wild type progenitors, were determined 
by characterizing the cell surface glycosylation phenotype 
determined by these enzymes, after expressing them in a 
mammalian cell line informative for the synthesis of four 
distinct a (1,3)- and a ( 1 , 4 ) -f ucosy lated cell surface 
oligosaccharides (Lewis x, sialyl Lewis x, Lewis a, and sialyl 
Lewis a) • Our results indicate that as few as eleven 
non-identical amino acids, found within a "hypervariable' 1 
peptide segment positioned at the NH 2 -terminus of the enzymes' 
sequence-constant COOH-terminal domains, determines whether or 
not these a(l,3)Fuc-T can utilize type I acceptor substrates 
to form Lewis a and sialyl Lewis a moieties. 
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Cell surface a (1,3)- and a(l, 4) -fucosylated 
oligosaccharides have received a substantial amount of 
attention because some are thought to be essential to the 



initiation of immune cell adhesion to vascular endothelium 
during the inflammatory process (Reviewed in Feizi, T. (1991) 
TIBS, vol. 16, 84-86; Springer, T.A. , and Lasky, L.A. (1991) 
Nature, vol. 349, 196-197; Brandley, B.K., et al (1990) Cell, 
vol. 63, 861-863; Varki, A. (1994) Proc. Natl. Acad. Sci. USA, 
vol. 91, 7390-7397; Lowe, J. B., Carbohydrate recognition in 
cell-cell interaction. Molecular Glycobiology . In: Frontiers 
in Molecular Biology. Fukuda, M. (ed) • Oxford University 
Press, Oxford, UK, 1994; Lowe, J.B., et al (1990) Cell, vol. 
63, 475-484; Phillips, M.L., et al (1990) Science, vol. 250, 



1130-1132; Walz, G. , et al (1990) Science, vol. 250, 1132- 
1137; Goelz, S.E. , et al (1990) Cell, vol. 63, 1349-1376; 
Tiemeyer, M. , et al (1991) Proc. Natl. Acad. Sex USA, vol. 88, 
1138-1142; Larsen, E., et al (1990) Cell, vol. 63, 467-474; 
Polley, M.J., et al (1991) Proc. Natl. Acad. Sci. USA, vol. 
88, 6224-6228; Zhou, Q. , et al (1991) J . Cell Biol., vol. 115, 
557-564; Berg, E.L., et al (1991) J\ Biol. Chem. , vol. 266, 
14869-14872; Takada, A. , et al (1991) Biochem. Biophys. Res. 
Comm., vol. 179, 713-719). This is perhaps best characterized 
by the "rolling" type of adhesion mediated by interactions 
between sialyl Lewis x (sLe x ) -bearing glycoconjugates on 
leukocytes and P- and E-selectin expressed by activated . 
vascular endothelium (Lowe, J. B. Carbohydrate recognition in 
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cell-cell interaction. Molecular Glycobiology. In: Frontiers 
in Molecular Biology. Fukuda, M. (ed) . Oxford University 
Press, Oxford, UK, 1994). Animal intervention studies 
indicate that such fucosylated oligosaccharide molecules can 
act as anti-inflammatory agents by inhibiting leukocyte- 
endothelial cell interactions (Takada, A., et al (1991) 
Biochem. Biophys. Res. Comm., vol. 179, 713-719; Mulligan, 
M.S., et al (1993) Nature, vol. 364, 149-151; Mulligan, M.S., 
et al (1993) J. Exp. Med., vol. 178, 623-631; Buerke, M. , et 
al (1994) *T. Clin. Invest., vol. 93, 1140-1148). Furthermore, 
absence of such molecules on the leukocytes of individuals 
with the rare human leukocyte adhesion II syndrome is 
associated with profound defects in leukocyte-endothelial cell 
adhesion and with non-pyogenic infections (Etzioni, A., et al 
(1992) New Engl. J. Med. vol. 327, 1789-1792; von Adrian U.H., 
et al (1993) J. Clin. Invest., vol. 91, 2893-2897). These 
observations suggest that compounds capable of specifically 
inhibiting leukocyte sialyl Lewis x expression might represent 
candidates for anti-inflammatory pharmacologic agents. 

The a (1,3) fucosyl transferases (a (1,3) Fuc-Ts) represent a 
target for such inhibitory agents, since synthesis and 
expression of the sLe* molecule and related a(l,3)- and a (1,4)- 
fucosylated oligosaccharides are controlled, in large measure, 
by a(l f 3) fucosyltransf erases (a (1,3) Fuc-Ts) (Lowe, J. B. 
Carbohydrate recognition in cell-cell interaction. Molecular 
Glycobiology. In: Frontiers in Molecular Biology. Fukuda, M. 
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ted) . Oxford University Press, Oxford, UK, 1994). These 
enzymes catalyze the attachment of L-fucose in alpha anomeric 
linkage to one or more distinct oligosaccharide precursors. 



Biochemical and molecular cloning studies indicate that the 
human genome encodes at least five distinct a(l,3)Fuc-Ts 
(Goelz, S.E., et al (1990) Cell, vol. 63, 1349-1376; Kukowska- 



Latallo, J.F., et al (1990) Genes Devel., vol. 4, 128B-1303; 
Love, J.B., et al (1991) J. Biol. Chem., vol. 266, 17467- 
17477; Kumar, R. , et al (1991) J. Biol. Chem., vol. 266, 
21777-21783; Weston, B.W., et al (1992) J*. Biol. Chem., vol. 
267, 4152-4160; Weston, B. W. , et al (1992) J. Biol. Chem., 
vol. 267, 24575-24584; Koszdin, K.L. and Boven, B.R. (1992) 

Biochem. Blophys. Res. Common., vol. 187, 152-157; Sasaki, K. , 

i 

et al (1994) J. Biol. Chem., vol. 269, 14730-14737; Natsuka, 
S., et al (1994) J. Biol. Chem., vol. 269, 16789-16794; 
reviewed in Lowe, J. B. Carbohydrate recognition in cell-cell 
interaction. Molecular Glycobiology. In: Frontiers in 
Molecular Biology. Fukuda, M. (ed) . Oxford University Press, 
Oxford, UK, 1994; Macher, B.A., et al (1991) Glycobiology, 
vol. 1, 577-584; Natsuka, S«, and Lowe, J.B. (1994) Curr. 
Opin. Struct. Biol. $ vol* 4, 683-691). Each enzyme can react 
with one or more structurally-distinct oligosaccharides and 
can thereby generate a unique spectra of cell surface 
a (1,3) -fucosylated oligosaccharide products. In turn, these 
molecules may exhibit distinct biologic functions, including 
those involving selectin-dependent cell adhesion. 





f 
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Although the primary sequences are known for several 
ot(l,3)Fuc-Ts (Go el z, S.E., et al (1990) Cell, vol. 63, 1349- 
1376; Kukowska-Latallo, J.F., et al (1990) Genes Devel., vol. 
4, 1288-1303; Love, J.B., et al (1991) J. Biol. Chem., vol. 
266, 17467-17477; Kumar, R. , et al (1991) J. Biol. Chem., vol. 
266, 21777-21783; Weston, B.W. , et al (1992) J. Biol. Chem., 
vol. 267, 4152-4160; Weston, B. W. , et al (1992) *J. Biol. 
Chem., vol. 267, 24575-24584; Koszdin, K.L. , and Boven, B.R. 
(1992) Biochem. Biophys. Res. Commun., vol. 187, 152-157; 
Sasaki, K. , et al (1994) J. Biol. Chem., vol. 269, 14730- 
14737; Natsuka, S., et al (1994) J. Biol. Chem., vol 269, 
16789-16794), the structural determinants within these enzymes 
that dictate their different substrate specificities remain 
undefined. The purpose of this work is to identify such 
protein sequence (s), and to provide a conceptual background 
for work to design or identify molecules that might inhibit 
a(l,3)Fuc-Ts by interacting with acceptor substrate binding 



In this study, we chose to explore three human ot(l,3)Fuc- 
Ts (Fuc-TIII, Fuc-TV, and Fuc-TVI) with informative structural 
and catalytic properties. These enzymes share approximately 
eighty-five percent overall amino acid sequence identity 
(Weston, B.W., et al (1992) J. Biol. Chem., vol. 267, 4152- 
4160; Weston, B. W. , et al (1992) J. Biol. Chem., vol. 267, 
24575-24584) . The COOH-termini of these enzymes maintain 
nearly identical amino acid sequences, while their Nl^-terminal 
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regions are punctuated by foci of non- identical amino acids; 
ve term these regions "hypervariable" segments. These three 
enzymes maintain shared and distinct acceptor substrate 
specificities. In particular, each of the three can 
effectively utilize neutral (GalBl-*4GlcffAc-) and 
a(2,3) sialylated (NeuNAca2-*»3Galfil-»4GlctfAc-) type II acceptors 
to create cell surface Levis x (Le x ) and sLe* determinants, 
respectively, when tested by transf action in cultured cells 
(Weston, B.W., et al (1992) J. Biol. Chem., vol, 267, 4152- 
4160; Weston, B. W. , et al (1992) J. Biol. Chem. , vol. 267, 
24575-24584), and when tested with low molecular weight 
acceptor substrates in in vitro fucosyltransf erase assays. 
Fuc-TIII can also efficiently utilize neutral (GalBl-OGlcNAc) 
and sialylated (NeuffAca2->3Galfil-+3GlcffAc-) type I acceptor 
substrates to generate Lewis a (Le*) and sialyl Lewis a (sLe*) 
molecules (Lowe, J.B., et al 1990) Cell, vol. 63, 475*484; 
Kukowska-Latallo, J.F., et al (1990) Genes Devel., vol. 4, 
1288-1303) in trans fee ted cells, and in vitro. By contrast, 
neither Fuc-TV nor Fuc-TVI directs formation of fucosylated 
type I antigens (i.e., Le* and sLe s ) when tested in trans fee ted 
cells, and neither can efficiently utilize low molecular 
weight acceptor substrates in vitro (Weston, B.W. , et al 
(1992) J. Biol. Chem., vol. 267, 4152-4160; Weston, B. W. , et 
al (1992) J. Biol. Chem., vol. 267, 24575-24584). Thus, the 
three enzymes ' shared, and distinct, catalytic specificities, 
along with similarities in, and differences between, their 
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primary sequences, provide an opportunity to identify amino 
acid residues present in Fuc-TIII and /or absent from Fuc-TV 
and Fuc-TVI, that determine type I oligosaccharide acceptor 
substrate utilization. 

To identify such residues, we studied the pair of enzymes 
that differ most dramatically in their respective abilities to 
utilize type I acceptors (Fuc-TIII versus Fuc-TVI; Figure 9a. 
We divided the amino- terminal "hypervariable" regions of each 
enzyme into five subdomains, each consisting of a distinct 
region of non-identical peptide sequence, separated by a 
region of identity (Fig. 9b) . We then created a series of 
recombinant cr(l,3)Fuc-T chimeras in which these domains, or 
other domains derived from the enzymes' COOH-terminal regions, 



were transposed by themselves, or in combination, between the 
three enzymes, at corresponding positions. Functional 
analysis of these Fuc-TIII : Fuc-TVI chimeras (Fuc-TC's) , and 
their wild type progenitors, via expression in a transfected 
mammalian cell line, indicates that the acceptor substrate 
specificity of these enzymes is determined by peptide 
sequences in the "hypervariable** NH 2 -terminal segments of these 
enzymes. When considered together with results obtained with 
a Fuc-TIII : Fuc-TV chimera, these analyses indicate that type I 
acceptor substrate utilization is determined by as few as 
eleven residues within a 61 amino acid long segment 
immediately proximal to the enzymes' "sequence-constant** COOH- 
terminal domains. 
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The present chimeric enzymes can be used in the same way 
as the other fucosyltransf erases described herein. 



Other features of this invention will become apparent in 
the course of the following description of exemplary 
embodiments which are given for illustration of the invention 
and are not intended to be limiting thereof. 

EXAM PL ES 

Example I. Cloning and Expression of a DNA Sequence 

Encoding a( 1,2) Fucosyltransf erase (DNA SEP ID 
WO: 5. protein SEP ID NO; 6 

Mouse L Cells as a Host for Human (a - 
1, 2) Fucosyl transferase Gene Transfer - House L cells were 
tested as a host for gene transfer. These cells have been 
widely used for this purpose. Genomic DNA may be introduced 
into L cells with high efficiency, and these cells allow the 
use of several metabolic and antibiotic resistance schemes for 
selecting stable incorporation of exogenous DNA sequences. 

L cells were examined for surface expression of the H 
Fuc a (1,2) Gal linkage, using a monoclonal antibody that 
recognizes type II H structures and fluorescence-activated 
cell sorting. Cells stained with this anti-H antibody exhibit 
a FACS profile virtually identical to the profile generated by 
cells stained with a control mouse IgM monoclonal antibody, 
and to profiles generated by cells stained only with 
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FI TO conjugated second antibody. These results indicate that 
L cells do not express surface-localized Fuc a (1,2) Gal 
linkages that are detectable with the anti-H-antibody. 

The inventor assayed L cell extracts to confirm that this 
absence of surface-expressed H determinants was due to a 
deficiency of (a-1,2) fucosyltransf erase activity • 
Phenyl-0-D-galactoside was used as the acceptor for assay of 
(a-1, 2) fucosyltransf erase. This compound is a specific 
acceptor for (a-1,2) fucosyltransf erases; it is used 
efficiently by these enzymes, yet does not function as an 
acceptor for fucosyltransf erases that generate (a-1, 3), (a- 
1,4), or (a-1, 6) linkages. L cell extracts contained no 
detectable (a-1,2) fucosyltransf erase activity, even in assays 
that contained increased amounts of extract, or that were 
subjected to prolonged incubation. Mixing experiments with 
A431 cell extracts showed that inhibitors were not responsible 
for an apparent lack of detectable enzyme activity. 

The inventor also examined L cells for surface expression 
of glyconjugates possessing N-acetyllactosamine 
(Gal/S(l,4)GlcNAc) end groups. These would represent potential 
acceptor molecules for a complementing human 
(a-1,2) fucosyltransf erase activity and would allow the 
resulting surface-expressed Fuca(l,2)Gal linkages to be 
detected with anti-H antibody. The agglutinin from Erythrina 
cristagalli (ECA) was used for this analysis. This lectin 
exhibits high affinity for oligosaccharides possessing one or 
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more unsubstituted N-acetyllactosamine end groups. L cells 
were stained with purified, FITC-labeled EGA, or with 
FITC-labeled EGA that had been preincubated with the hapten 
N-acetyllactosamine and were subjected to FACS analysis. The 
results indicated that significant amounts of ECA bind to 
these cells, and that the binding is effectively inhibited by 
the hapten N-acetyllactosamine. These results are consistent 
with the expectation that L cells synthesize oligosaccharides 
containing N-acetyllactosamine moieties and suggest that some 
of these glycoconjugates remain unmodified and are expressed 
at the cell surface. 

The inventor also tested L cells for the ability to 
synthesize the fucosyltransf erase substrate GDP-fucose. These 
analyses identified both GDP- 1 3 !!] f ucose and GDP-[ 3 H]mannose in 
aqueous extracts prepared from cells labeled with 
[2- 3 H]mannose* The subcellular location of GDP- [*H] fucose in 
these cells cannot be determined from these experiments. 
Effective Golgi fucosyltransf erase activity would presumably 
require the presence of substrate concentrations of GDP-fucose 
within the lumen of the Golgi. Since these cells have not 
been selected to the defective in fucose metabolism, it seemed 
likely that they would be competent to transport this 
cytoplasmically synthesized compound into the Golgi lumen. 
This was confirmed by demonstrating that these cells are able 
to incorporate radio- labeled fucose into membrane 
glycoconjugates; most of this may represent fucose in a (1,6) 
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linkage to the asparagine-linked N-acetylglucosamine of some 
N- linked oligosaccharides. 

Taken together, these studies show that L cells are 
competent to display surface-localized H Fuca(l,2)Gal 
linkages, after introduction and expression of human DNA 
sequences determining (a-1,2) fucosy It ransf erase activity. 

Human A431 Cells as a Donor for (a-1, 2) Fucosy 1 transferase 
DNA Sequences - The human A431 cell line was investigated as a 
source of DNA for gene transfer since these cells express type 
I and II blood group H structures. Extracts prepared from 
A431 cells are found to contain (a-1,2) fucosy ltransf erase 
activity when assayed using phenyl-0-D-galactoside. The 
radiolabeled product elaborated by A431 extracts 

V 

cochromatographed with authentic [ 14 C] fucosy lated phenyl- 
jS-D-galactoside produced by human serum H (a- 
1,2) fucosy ltransf erase. Digestion of the A431 product with 
a-L-fucosidase generated L-fucose in quantitative yield. 
These results indicate that A431 cells contain one or more 
functional (a-1,2) fucosy ltransf erase genes and thus represent 
an appropriate source of human DNA for gene transfer. 

Isolation of a Mouse Transfectant That Expresses Surface 
Molecules Recognized by a Monoclonal Anti-H Antibody - To 
isolate mouse cells containing DNA sequences that determine 
expression of a human (a-l p 2) fucosy ltransf erase, monolayer 
cultures of L cells were cotransf ected with 30:1 ratio of high 
molecular weight genomic DNA prepared from A431 cells and 
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pSV2-neo plasmid DNA. Cotransf ection with pSV2-neo followed by 
growth of the transfected cells in media containing G418 
allows selection of transfectants that have stably 
incorporated exogenous DNA sequences. With this procedure the 
inventor generated a population of cells representing 
approximately 60,000 independent G4i8 -resistant transfectants. 
This method typically incorporates approximately 1000 kb of 
transfected sequences into the genome of a recipient cell. 
Since the size of the human genome is approximately 3 X 10* kb, 
the inventor estimated that approximately 20 copies of the 
haploid human genome were represented within this "library" of 
primary transfectants* 

Transfectants were selected for H antigen expression by a 
combination of panning and sterile cell sorting. A pool of 
cells representing the entire population of transfected cells 
was reacted with a mouse IgM monoclonal antibody that 
recognizes type II H structures. Transfectants with bound 
anti-H antibody were subjected to an initial selection by 
panning on sterile dishes coated with goat anti-mouse IgM. At 
this stage, the inventor found this procedure to be more 
effective than selection by flow cytometry because it allowed 
larger numbers of transfectants to be rapidly and easily 
processed. Transfectants selected by panning were returned to 
culture to be amplified for subsequent rounds of selection. 
The FACS profile of cells present after this first selection 
revealed no obvious peak of cells that had bound the anti-H 
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antibody. However, analysis of the FACS histogram indicated 
that approximately 0.13% of the cells stained more brightly 
than cells stained with the control antibody. Cells 
representing the brightest 3-5% of the total population were 
aseptically collected, returned to culture for 14 days, and 
then reselected by the same procedure.. After three 
selections, PACS analysis revealed the presence of a distinct 
population of cells that were brightly stained with the anti-H 
antibody. These cells were collected and returned to culture. 
For heuristic reasons, transf ectants were also subjected to 
selection by panning, in parallel with the FACS selections. 
The inventor found that the panning procedure more efficiently 
enriched for populations of cells that bound the anti-H 
antibody. This is perhaps because the IgM anti-H antibody 
induced agglutination of H-positive transf ectants and 
interfered with selection by FACS. 

Therefore, all subsequent selections were performed by 
the panning procedure* After three additional rounds of 
panning (representing a total of seven rounds of selection) , 
more than 90% of the cells within the selected population 
stained brightly with anti-H antibody. Clonal isolates from 
this population were generated, and individual subclones were 
analyzed for H antigen expression by FACS. Most clones gave 
rise to phenotypically mixed populations of cells consisting 
of H-expressing and non-expressing transf ectants . The reasons 
for this apparent phenotypic instability are not known. One 
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clone that exhibited a stable, bright H antigen-positive 
phenotype was selected for further analysis (clone mHl-12) . 
The phenotype of clone mHl-12 has remained stable for more 
than 9 months in the absence of selection of H expression. 

The inventor wished to rule out the possibility that a 
murine (a-1, 2) fucosyltransf erase gene might be active in rare 



variants in the L cell population or that the transfection 
procedure itself might activate this gene and that the 
selection process might enrich for these undesired events. 
Therefore, in a parallel control experiment, L cells were 
transfected with high molecular weight genomic DNA prepared 
from L cells, using pSV2-neo as the selectable marker. These 
transfectants were then subjected to selection for H antigen 



expression, exactly as described above. The inventor was 
unable to detect or isolate H-expressing cells from a 
population of independent transfectants (at least 40,000) that 
together has integrated the equivalent of more than 15 copies 
of the haploid murine genome. 

The Primary Trans feet ant Expresses Cell Surface Type II 
Blood Group H Antigen and (o-l, 2) Fucosyltransf erase Activity - 
Clone mHl-12 was selected with a monoclonal anti-H antibody 
that recognizes type II blood group H structures 
(Fuca(l,2)Gal/3(l,4)GlcNAc-R) . Binding of this antibody to 
mHl-12 cells is blocked when the antibody is preincubated with 
the type II H hapten 2 '-fucosyllactose 

(Fuca(l,2)Galj3(l,4)Glc) . By contrast, preincubation of the 
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anti-H antibody with L-fucose, or with N-acetyllactosamine or 
lactose, at identical concentrations, does not inhibit binding 
of the antibody to mHl-12 cells. When a different monoclonal 
anti-H antibody (BE2) previously shown to be specific for type 
II H structures was used in these experiments , the inventor 
also observed inhibition of binding with 2'-fucosyllactose, 
but not with the other haptens. These studies indicate that 
mHl-12 cells express cell surface glycoconjugates with 
terminal Fuca(l,2)Gal linkages. 

Additional evidence for the presence of this linkage was 
obtained by using the linkage-specific blood group A 
(oc-l,3)GalNAc transferase purified from human plasma. This 
gly cosy ltransf erase has an absolute requirement for blood 
group H acceptors containing fucosyl a(l,2) galactoside as the 
terminal nonreducing group. It catalyzes the addition of N- 
acetylgalactosamine in a-1,3 linkage to the galactose moiety 
of this structure to construct blood group A-reactive 
molecules of the form GalNAa(l,3) [Fuca(l,2) ]Gal. Generation 
of blood group A-reactive determinants on the surface of 
mHl-12 cells by the action of blood group A 

glycosyltransferase would provide confirmation of the presence 
of terminal Fuca(l,2)Gal linkages inferred by the results of 
the type II H hapten inhibition study. 

Formalin-fixed mHl-12 cells were incubated with 
preparations of the blood group A (oe-1, 3)GalNAc transferase 
and its nucleotide sugar substrate (UDFGalNAc, l mM, 
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approximately 20-fold above K m for UDP-GalNAc) in a buffer 
supporting activity of this enzyme. The cells were then probed 
for the presence of newly synthesized, surface-localized blood 
group A determinants by indirect immunofluorescence using a 
monoclonal anti-A antibody. 

After a 4-h incubation with the A (cr-1,3) GalNAc 
transferase and its substrate, blood group A determinants were 
detectable on the surface of the cells. No staining with 
anti-A antibody was observed in control reactions done in the 
absence of either UDP-GalNAc or group A transferase. L cells 
showed no binding of anti-H or anti-A under any of these 
conditions • 

The inventor also stained A enzyme-treated cells with 
anti-H antibody to test for loss of surface expressed H 
structures. These should be "masked" with 
M-acetylgalactosamine molecules that are attached by the A 
enzyme to the galactose of the Fuca(l,2)Gal linkage. After a 
4-h incubation with both A enzyme and its substrate, the 
staining generated by the anti-H antibody was only slightly 
diminished. However, after the conversion reaction was 
allowed to proceed for 24 h, essentially complete elimination 
of cell surface H reactivity was seen. This is coincident 
with continued expression of strong A reactivity. Cells 
treated for 24 h with a control reaction mixture containing 
the A enzyme but without substrate exhibited strong anti-H 
staining. This indicates that loss of H reactivity seen after 
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the 24-h reaction was not due to destruction of H structures 
by glycohydrolase or protease activities contaminating the 
group A enzyme preparation. Loss of H reactivity after 
prolonged incubation thus represents "masking" of H structures 
by the A enzyme-catalyzed attachment of a -1,3 -linked N- 
acetylgalactosamine. These data indicate that mHl-12 cells 
express cell surface glycoconjugates terminating with 
authentic H Fuca(l,2)Gal linkages. 

Assays of extracts prepared from mHl-12 cells confirmed 
that these cells express or-(l,2) fucosyltransferase activity. 
a-Fucosidase digestion of the fucosylated product of these 
reactions confirmed the cr anomeric linkage of the attached 
fucose. 

Analysis of the Human DNA Sequences in the Primary 
Trans fectant - Southern blot analysis was used to determine if 
the mHl-12 cell line contains human DNA sequences. The BLUR8 
Alu sequence was used to detect human sequences. With the 
hybridization and washing conditions used, the human Alu probe 
did not cross-hybridize with mouse sequences, but was able to 
detect the equivalent of a few copies of an Alu sequence that 
had been added to mouse L cell DNA (10 pg) . By comparison, 
the A431 DNA sample (3 ng) displayed a diffuse yet relatively 
intense hybridization signal expected for the highly 
repetitive interspersed Alu sequences. Under these 
conditions, the inventor was able to detect significant 
amounts of human sequences in the genome of mHl-12 cells (500 
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ng) . This analysis indicates that, as expected, the genome of 
mHl-12 cells contains roughly 1000 kb of human DNA. 

Isolation of Multiple Secondary Transfectants That 
Express Cell Surface Type II H Antigen and (a- 
1,2) Fucosyltransf erase Activity - The inventor wished to be 
able to identify within the large number of human sequences in 
the genome of mHl-12 cells specific human sequences that 
mediate expression of its H-positive phenotype. To reduce the 
amount of extraneous human DNA, the inventor used DNA prepared 
from the mHl-12 cell line to generate "secondary" trans fectant 
"libraries," and screened these libraries for transfectants 
that expressed the H structure. The inventor expected that 
H-expr ess ing secondary transfectants so identified would each 
have a small number of human DNA restriction fragments 
identifiable within their genomes. The inventor sought to 
isolate several independent secondary transfectants since it 
was anticipated that human sequences linked to H-determining 
gene(s) should be a subset of these human fragments, 
identifiable as characteristic restriction fragments of 
identical sizes in each independently derived H-expressing 
secondary trans fectant. 

Genomic DNA prepared from mHl-12 cells was contransfected 
with pSV2-neo into L cells. Four different secondary 
libraries were generated in this way (Table I) . 
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Table I 

Estimated frequencies of H antigen-positive 
transfectants in six independent libraries 

Frequencies are expressed as one independent H-expressing 

transfectant isolated /number of plates of transfectants 

screened. For libraries screened by cell sorting or panning, 

this is a minimum estimate since these immunoselection 

procedures do not allow discrimination between H-expressing 

sibs and independently derived H -positive transfectants; at 

least one independent H-expressing transfectant was present in 

each of these libraries* For the primary library mHl, and 

secondary libraries mHsl, mHs3, mHs4, and mHs5, each plate 

contained approximately 2000 independent transfectants, as 

determined from transfection efficiency estimates (see 

w Experimental Procedures' 9 ) . For the mHs2 secondary library, 

inspection of the plates prior to screening indicated that 

approximately 50 colonies were present on each plate. Clones 

s2-l and s2-2 were isolated with the rosette procedure from 

two separate plates. Clone s2-3 was isolated by panning a 

population of cells representing approximately 650 independent 

transfectant colonies. 
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Library name 


Source of 
transf ected 
DNA 


Added 
selection 
plasmid 


Fraction of 
transfeetants 
expressing H 
determinants 


mHl (primary) 
mHal (secondary) 
mHB3 ( secondary) 
mHs 4 ( secondary ) 
mHsS ( secondary) 
mHs2 ( secondary ) 


A431 Cells 
mHl-12 
mHl-12 
mHl-12 
mHl-12 
mHl-12 


pSV2-neo 
pSV2-neo 
pSV2-neo 
pSV2-neo 
pSV2-neo 
None 


2 1/30 dishes 
2 1/30 dishes 
2 1/10 dishes 
2 1/10 dishes 
2 1/10 dishes 
2 1/-650 colonies 
2 1/-50 colonies 
(rosette s2-l) 
2 1/-50 colonies 
(rosette 82-2) 



Each library was independently screened for H-expressing 
transfeetants using the panning procedure. Prior to the third 
round of panning, FACS analysis indicated that each of these 
libraries contained H antigen-positive cells (1-60% of the 
cells bound anti-H antibody) . Sequential selection by panning 
was continued until 50-90% of the cells exhibited the H — 
positive phenotype. Clonal cell lines (Sl-11 and S3-6) were 
then established from populations derived from the mHsl and 
mHs2 libraries; more than 95% of the cells in these lines 
exhibited bright staining with anti-H antibody. Libraries 
mHs 4 and mHs5 were not subjected to the cell cloning procedure 
but were instead subjected to additional selections by 
panning. After a total of 11 rounds of selection, 
approximately 95% (selected from mHs4 library) and 50% 
(selected from mHs5 library) of these cells exhibited H 
antigen expression. 
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The calcium phosphate transfection procedure the inventor 
used for constructing the mHl-12 cell line occasionally 
results in physical linkage of selectable plasmid sequences 
with the trans feet ed genomic DNA sequences that determine the 
desired phenotype. Linkage of pSV2-neo sequences to human DNA 
sequences determining the H-positive phenotype in the mHl-12 
primary trans feet ant would simplify identification of 
H-expressing secondary transfectants and would facilitate 
isolation of the relevant transf ected sequences by molecular 
cloning procedures. As a test of such linkage, the inventor 
generated a secondary library, mHs2, by transf ecting L cells 
with DNA prepared from the mHl-12 primary transf ectant. The 
transfection was done without the addition of exogenous 
pSV2-neo DNA. This procedure yielded approximately 50 
independent 6418 transfectants on each of fifteen 10-cm 
dishes. This represents a 40-fold reduction in the number of 
6418-resistant colonies obtained, relative to the numbers 
generated when secondary libraries were generated with the 
addition of pSV2-neo DNA (-2000 G418 resistant cells/dish) . 

This mHs2 library was initially screened with an in situ 
rosette procedure for rapid identification of transf ectant 
colonies that bound anti-H antibody (see "Experimental 
Procedures*' below) . Culture dishes containing approximately 
50 colonies each were screened with this method 16 days after 
transfection and prior to any other manipulations. A single 
rosette-positive colony was identified on two of the 15 dishes 
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tested. These two independent H-positive colonies were 
isolated with cloning rings and an H-expressing cell line 
(s2-l and s2-2) was established from each. An additional, 
independent, clonal H-expressing transfectant (s2-3) was 
isolated by harvesting the colonies on the other 13 dishes and 
subjecting these cells to selection by panning and then cell 
cloning. 

Cotransf ection of unlinked single-copy markers occurs at 
a frequency less than 1%. The frequencies of coexpression of 
G418 resistance and the H phenotype the inventor observed in 
the mHs2 library (Table I) are consistent with the possibility 
that the two markers are linked in the primary transfectant. 
Alternatively , these frequencies could be explained by 
cotransf ection of unlinked markers present in multiple copies 
in the primary transfectant. In any event, the frequencies of 
H-expressing transf ectants observed in the primary and 
secondary libraries (Table I) indicate that the H-positive 
phenotype expressed by these transf ectants is determined by a 
single transf ected locus. 

The anti-H reactive surface molecules on a representative 
H-expressing secondary transfectant (clone s2-2) were shown to 
be authentic H Fuca(l,2)Gal linkages, using analyses identical 
to those used for the H antigen-positive primary transfectant 
mHl-12. Extracts prepared from s2-2 cells were found to 
contain (a-1, 2) fucosyltransf erase activity. 

(a-l,2)Fucosyltransferase activity was also found in extracts 
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prepared from each of the other H- expressing secondary 



Independent H-Expressing Secondary Transfectants Have 
Common Restriction Fragments Containing Human DNA Sequences - 
The inventor anticipated that the genome of each H-positive 
secondary transfectant would contain a relatively small amount 
of human DNA and that this DNA would include human sequences 
controlling expression of the (a-1, 2) fucosyl transferase found 
in each. In principle, one or more characteristic restriction 
fragment (s) generated by these sequences should be 
identifiable in every transfectant. Conversely, irrelevant 
human sequences should exhibit a random restriction pattern in 
the secondary transfectants. The inventor therefore isolated 

i 
\ 

genomic DNA from each transfectant, digested these DNAs with 
various restriction enzymes, and subjected these digests to 
Southern blot analysis. Restriction fragments containing 
human DNA sequences were detected with the BLUR8 Alu probe. A 
number of DNA restriction fragments are present in each clonal 
secondary transfectant; the aggregate amount of human genomic 
DNA present in these cells is estimated to be between 25 and 
55 kb. The genome of each clonal secondary transfectant 
contains a characteristic pair of human DNA EcoRI restriction 
fragments with sizes of 2,7 and 3.4 kb. These fragments are 
also evident in pools of cells selected by panning from 
libraries mHs4 and mHs5. Similar analyses indicate that the 
genome of each H-expressing secondary transfectant contains 
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common 1.5 and 1.9 kb PstI fragments and a common 2.8-kb PvuII 
human DNA restriction fragment. These observations imply that 
DNA sequences within or linked to these characteristic human 
restriction fragments are associated with expression of the 
cell surface H Fuca(l,2)Gal linkages used to select these 
trans feet ants, and are thus implicated in the expression of 
the (a-1 , 2) fucosyl transferase found in these cells. 

To further confirm that the common human DNA sequences in 
the transf ectants direct (crl, 2) fucosyltransf erase expression 
in these cells, molecular cloning procedures were used to 
isolate these fragments and then test their function in a 
mammalian transient expression system. The two human DNA 
EcoRl fragments previously found to be associated with 

\ 

expression of the H phenotype in the H- expressing secondary 
transf ectants were isolated from mini genomic libraries 
prepared in a lambda phage vector, using the secondary 
transf ectant s2-2 (see " Experimental Procedures"). To 
determine if these fragments contained sufficient genetic 
information to direct synthesis of (al, 2) fucosyltransf erase, 
these were first individually subcloned into the mammalian 
expression cosmid vector pWEl5. This vector contains the SV40 
origin of replication, enabling it to replicate efficiently as 
an episome in COS-1 cells. The resulting plasmids contained 
either the 3.4 kb EcoRX (plasmid pH3.4) or the 2.7 kb EcoRI 
fragment (plasmid pH2.7). These plasmids were then 
individually introduced into COS-1 cells by DEAE-dextran 
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transfection (see "Experimental Procedures 19 ) and the 

transfected cells were subsequently assayed for 

(al, 2) fucosyl transferase activity. The inventor found no 



detectable (al,2) fucosyltransf erase activity in mock 
transfected COS-1 cells or in COS-1 cells transfect with 
pH2.7. However, COS-1 cells transfected with plasmid pH3.4 
expressed significant amounts of (al, 2) fucosyltransf erase 
activity. a-Fucosidase digestion of the fucosylated phenyl-0- 
D galactoside product generated by this extract confirmed the 
alpha anomeric configuration of the attached fucose (see 
"Experimental Procedures 91 ) . 

The pH-activity profile of this (al, 2) fucosyltransf erase 
mirrors the profiles determined for the 



(al, 2) fucosyltransf erases found in fractionated human serum, 
in A431 cells, and in the H-expressing mouse transf ectants . 
Likewise, the apparent Michaelis constants exhibited by the 



as those determined by the inventor for the 

(al, 2) fucosyltransf erases in fractionated human serum, and in 
each of the cell lines he analyzed. Considered together, 
these results are consistent with the proposal that human DNA 
sequences within the 3.4 kb EcoRI fragment encode an 
(al, 2) fucosyltransf erase r and that these sequences encompass 



part or all of the human blood group H 

(al, 2) fucosyltransf erase gene. The 3.4 kb EcoRI fragment in 



recombinant enzyme expressed in COS-1 cells (GDP-fucose 1^=17.5 
/iM; phenyl-0-D-galactoside K m =4.4 mM) are essentially the same 
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pH3.4, the 2.7 kb EcoKI fragment in pH2.7, and DNA sequence 
adjacent to the 3.4 kb fragment, were sequenced to provide SEQ 
ID NO: 5. 

To characterize the nature of these cloned human genomic 
DNA sequences r the inventor first isolated various restriction 
fragments from the insert in plasmid pH3.4 and tested these 
for their ability to identify transcripts in the H-expressing, 
stable transf ectants , and in a human cell line (A431) that 
also expresses H determinants and a cognate 
(al, 2) fucosyl transf erase. He found that a 1.2 kb Hinfl 
restriction fragment from the insert in pH3.4 identifies a 
single, relatively non-abundant 3*6 kb transcript in A431 
cells. This probe also detects transcripts in the H- 



expressing mouse L cell transf ectants, but not in the non- 
trans feet ed parental L cells. 

A cloned cDNA that directs expression of cell surface H 
structures and an fal. 21 fuc osvltransf erase. The inventor used 
the 1.2 kb Hinfl fragment and colony hybridization to isolate 
two hybridization positive cDNA clones from an A431 cell cDNA 
library. To test the cloned cDNAs for their ability to 
determine expression of surface-localized H antigen and a 
cognate (al,2)fucosyltransferase activity, a plasmid was 
constructed (pCDM7-a(l,2)FT, see "Experimental Procedures") 
that consisted of the largest cDNA insert cloned into the 
mammalian expression vector pCDM7, in the sense orientation 
with respect to the vector's enhancer-promoter sequences. 
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Flow cytometry analysis of COS-1 cells transfected with FCDM7- 
a (1,2) FT indicates that this cDNA determines expression of 
cell surface H molecules. Moreover, COS-1 cells transfected 
with pCDM7-a-(l,2)FT, but not cells transfected with pCDM7, 
express substantial quantities of an (otl,2) fucosyltransf erase 
activity. The inventer determined the apparent Michaelis 
constant exhibited by this (al, 2) fucosyltransf erase for an 
artificial acceptor (phenyl-0-D-galactoside) that is specific 
for this enzyme and that can discriminate between the human H 
and Se (al, 2) fucosyltransf erases. This apparent Km (2.4mM) is 
nearly identical to the apparent Km the inventor (3.1 mM r ) and 
others (4.6 IBM, 6.4 mH, 1.4 mH) have determined for the blood 
group H (al, 2) fucosyltransf erase. Moreover, this apparent Km 
is also very similar to the one exhibited by the 
(al, 2) fucosyltransf erase in extracts prepared from COS-1 cells 
transfected with pH3.4 (4.4 mM) • This apparent Km is distinct 
from the one exhibited by an (al, 2) fucosyltransf erase found in 
human milk enzyme (15.1 mM) , that is though to represent the 
(al, 2) fucosyltransf erase encoded by the Se locus. These data 
demonstrate that the cDNA in plasmid pCDM7-a(l, 2) FT determines 
expression of an (al, 2) fucosyltransf erase whose kinetic 
properties reflect those exhibited by the human H blood group 
( al , 2 ) fucosyltransf erase . 

The cDNA sequence predicts a Type II transmembrane 
glycoprotein. The cDNA insert in pCDM7-a(l,2)FT is 3373 bases 
pairs long. Its corresponding transcript is 3.6 kb in length, 
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suggesting that this cDNA is virtually full-length. Two 
potential initiator codons are found within its first 175 
nucleotides. Only the second of these, however, is embedded 



within a sequence contest associated with mammalian 
translation initiation. The methionine codon initiates a long 
open reading frame that predicts a protein of 365 amino acids 
(SEQ ID NO: 6), with a calculated Mr of 41,249 Da. This open 
reading frame is colinear with the open reading frame found in 
the 3.4 kb EcoBl fragment in pH3.4. Hydropathy analysis of 
the predicted protein sequence indicates that it is a Type II 
transmembrane protein, as noted for several other cloned 
glycosyltransf erases . This topology predicts an 8 residue NHj- 
terminal cytosolic domain, a 17 residue hydrophobic 



transmembrane domain flanked by basic amino acids, and a 340 
amino acid COOH-terminal domain that is presumably Golgi- 
resident and catalytically functional. Two potential N- 
glycosylation sites are found in this latter domain suggesting 
that this sequence, like other glycosyltransf erases, may exist 
as a glycoprotein. No significant similarities were found 
between this sequence and other sequences in protein or DMA 
databases (Protein Identification Resource, Release 21.0, and 
Genbank, Release 60.0), with the exception of a 642 bp 
sequence within the 3 ' -untranslated segment of the cDNA that 
is similar to the human Alu consensus sequence. Moreover, the 
inventor identified no significant sequence similarities 
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between this cONA sequence or its predicted protein sequence, 
and those of other cloned glycosyltransf erase cDNAs. 

The protein encoded bv the cDNA is an 
(al. 2) fucosvltransf erase. The results of the expression 
experiments present above, when considered together with the 
domain structure predicted by the cDNA sequence, are 
consistent with the presumption that it encodes an 
(ctl , 2) fucosyl transferase. Nonetheless, the inventor wished to 
directly confirm this, and thus exclude the possibility that 
it instead encodes a molecule that trans -det ermines this 
enzyme activity* The inventor therefore fused the putative 
catalytic domain of the predicted protein to a secreted form 
of the IgG-binding domain of Staph, aureus protein A (see 
Experimental Procedures) in the mammalian expression vector 
pPROTA, to yield the vector pPROTA-a ( 1 , 2 ) FT C . By analogy to 
similar constructs the inventor has prepared with other cloned 
glycosyltransf erases (detailed Infra ) , he expected that if the 
cDNA sequence actually encodes an (al, 2) fucosyltransf erase, 
then plasmid pPROTA-a ( 1 , 2 ) FT C would generate a secreted, 
soluble, and affinity purifiable (otl, 2) fucosyltransf erase. 
Indeed, conditioned media prepared from a plate of COS-1 cells 
trans fee ted with pPROTA-a (1,2) FT C contained a total of 5,790 
units of (al, 2) fucosyltransf erase activity, whereas a total of 
1,485 units were found to be cell-associated. Moreover, 
virtually 100% of the released (al, 2) fucosyltransf erase 
activity was specifically retained by IgG-Sepharose, and most 
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could be recovered after exhaustive washing of this matrix. 
By contrast, the inventor found that most of the activity in 
COS-1 cells transfected with pCDH7-ot(l,2)FT was cell- 
associated (3450 units), with only trace amounts of activity 
in the conditioned media prepared from these cells (-80 
units) . Virtually none of this latter activity bound to 
either matrix. Extracts prepared from COS-l cells transfected 
with vector pCDM7 or vector pPROTA did not contain any 
detectable cell-associated or released 

(crl,2) fucosyl transferase activity. These data demonstrate 
that the cDNA insert in pCDM7-a(l,2)FT encodes an 
(crl,2) fucosyltrans f erase, and that information sufficient to 
generate a catalytically active (al,2) fucosyl transferase is 
encompassed with the 333 amino acids distal to the putative 
transmembrane segment. 

Experimental Procedures foi; pya^nple T f "Cloning and 
Expression of a DNA Sequence Encoding 
( ttl . 2 ) Fucosvltransf erase" . 

The term "L cells" used throughout the text refers to the 
mouse Laprt'tk" cell line. 

Lactose , N-acetyllactosamine , 2 ' -f ucosy llactose 
(Fuca(l,2) Gal0(l,4)Glc) , UDP-GalNAc, phenyl-/3-D-galactoside, 
and Ficoll 400 were obtained from Sigma. L-Fucose was from 
Pfanstiehl Labs (Waukegan, IL) . UDP[1- 3 H ^-acetylgalactosamine 
(8.7 Ci/mmol) and D-[U- 14 C]mannose (239 mCi/mmol) were from 
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DuPont-New England Nuclear. D-[2- 3 H]mannose (16.3 Ci/mmol), 
L-[6- 3 H]fucose (72 Ci/mmol), L-[1- U C] fucoBe (58.7 mCi/mmol) , 
GDP[U- u C]-0-L-fucose (268 mCi/mmol) , and [a- 32 P]dCTP (3000 
Ci/mmol) were from Amersham Corp. Nonradioactive GDP-fucose 
was kindly provided by Dr. Eric Holmes (Seattle). FITC-ECA 
was obtained from E-Y Labs (San Mateo, CA) . PI asm id pSV2-neo 
was obtained from Dr. David Chaplin (Washington University, 
St. Louis) . Restriction enzymes (New England Biolabs or 
Boehringer Mannheim) were used according to the manufacturer's 
instructions • 

Antisera : 

Monoclonal anti-H, anti-A, and anti-B antibodies (mouse 
IgM) were purchased from Chembiomed, Ltd. (Alberta, Canada) . 
Monoclonal anti-H antibody BE2 was prepared from BE2 hybridoma 
cell culture supematants (see below) . Ant i -mouse IgM and 
FITC-labeled antimouse IgM (both antigen-affinity purified, 
goat) were from Sigma. 

Cell Lines and Culture; 

Mouse Laprt'tk" cells were obtained from Dr. David 
Chaplin. Human A431 cells were from Dr. Brian Whiteley and 
Dr. Luis Glaser (Washington University, St. Louis) . BE2 cells 
were obtained from the American Type Culture Collection. 
Cells were grown in Dulbecco's modified Eagle's medium (6IBC0) 
supplemented with 10% fetal calf serum (Hyclone, Logan, UT) . 
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Transfected cells were grown in media containing G418 (6IBC0) 
at 400 pg/ml (active drug) . 



Preparation of Genomic DNA; 

High molecular weight genomic DNA was prepared from 
cultured cells by standard methods. Samples of genomic DNA 
were electrophoresed through 0.3% agarose gels buffered in 
Tris-acetate-EDTA to confirm their integrity and to estimate 
the average size of the molecules in the preparations. 



Transfections: 

The calcium phosphate precipitation method was used to 
transfect mouse L cells with human genomic DNA. Cells (5 X 
10 5 /100-mm dish) were incubated overnight with DNA precipitates 
20-30 /ig of genomic DNA and lpg of pSV2-neo) . No exogenous 
pSV2-neo DNA was included in transfect ions that generated the 
mHs2 secondary library. The cells were fed fresh media the 
following day and were placed under G418 selection the next 
day* Transfection efficiencies were estimated by harvesting 
transfected cells 1 day after addition of DNA and plating 
duplicate serial dilutions of the cell suspensions. One set 
of dilutions was grown under G418 selection, and the other was 
grown in the absence of antibiotic to allow the derived 
transfection efficiencies to be corrected for plating 
efficiency. After 2 weeks of growth, colonies were counted 
after staining with 0.2% methylene blue in 50% methanol. 
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Approximately 2000 independent transf ectants were typically 
obtained by transfecting 5 X 10 s cells on a lOO-mrn dish. 

Immunologic Selection of H-expressincr Transf ectants t 

Transf ectants were removed from culture dishes by 
incubating them with PBS containing 3 mM EDTA. Detached cells 
were washed and resuspended in staining medium (10 mM Hepes, 
pH 7.4, 0.1% sodium azide, 2% fetal calf serum in Dulbecco's 
modified Eagle's medium). The cells were kept at 4°C 
throughout the panning or cell sorting procedures. Cell 
cloning was done by plating cells at low density! allowing 
individual cells to form colonies, and isolating individual 
colonies with cloning cylinders. 

Panning - Bacteriological culture dishes (Falcon 1007, 60 
mm) were prepared for panning. Goat antimouse IgM was coupled 
to the dishes by incubating them overnight at 4°c with 4 ml of 
antibody solution diluted to 10 jig/ml in 50 mM Tris, pH 9.5. 
The antibody solution was aspirated, and the dishes were 
washed twice with PBS. The dishes were then blocked by 
incubating them at room temperature for at least 1 h with PBS 
containing 1 mg/ml bovine serum albumin. Dishes were then 
used immediately or were stored indefinitely at 4°C. The 
dishes were washed three times with PBS prior to use. 

Cells to be panned were resuspended at a concentration of 
10 7 /ml # in staining media containing anti-H antibody at 10 
pg/ml. The cells were incubated for 30 min at 4°C, and unbound 
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antibody was removed by pelleting the cells through 10 ml of 
PBS containing 1 mM EDTA, 0.1% sodium azide, and 2% Ficoll 
400. After centrifugation, the supernatant was carefully 
aspirated, and the cells were resuspended at 10 6 /ml, in 
staining media. Three-ml aliquots of this cell suspension 
were applied to 60 -mm panning dishes coated with goat 
anti-mouse IgN. The dishes were then incubated for 1 h at 
4°C, and were then rinsed 5 times with serum-free Dulbecco's 
modified Eagle's medium to remove nonadherent cells. Fresh, 
serum-replete media was added to the dishes, and they were 
returned to the tissue culture incubator. The next day, 
adherent cells were removed with trypsin-EDTA and were 
replated on standard tissue culture dishes. These cells were 
grown for 10-18 days prior to the subsequent selection. 

Cell Sorting - Transf ectants were prepared for FACS 
analysis by incubating them for 30 min at 4°C with monoclonal 
IgM anti-H antibody (10 pg/ml in staining media) or with a 
control monoclonal IgM anti-B antibody (10 in staining 

media) ♦ The cells were then washed in ice-cold staining 
media, and incubated for 30 min at 4°C in staining media 
containing f luorescein-conjugated goat antimouse IgM at 40 
jig /ml. The cells were washed, resuspended in staining media, 
and subjected to analysis by the FACS (Coulter Electronics 
model Epics C) . Samples were gated on forward and 90° light 
scatter to eliminate dead cells from analysis. H-expressing 
cells were collected aseptically into staining media and then 
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returned to culture for 10-18 days before additional 
selections* 

Rosette Procedure - A rosetting method was used to 
identify colonies of transfectants that bound anti-H antibody* 
This was done on 100-mm dishes containing isolated colonies 
comprised of approximately 100-300 cells. Plates of colonies 
were first rinsed twice with PBS and were then incubated for 1 
h at 4°C with 4 ml of mouse IgH monoclonal anti-H antibody at 
10 /jg/ml in PBS, 2% fetal calf serum, 0.1% sodium azide. The 
plates were then rinsed three times with PBS, and were 
incubated for 30 min at 4°C with 4 ml of human erythrocytes 
conjugated with goat anti-mouse IgM. (Goat anti-mouse IgM was 
coupled to human blood group O red cells with chromic 
chloride. After conjugation, the red cells were washed with 
PBS, diluted to a 0.2% v/v suspension in PBS, 2% fetal calf 
serum, 0.1% sodium azide, and were used immediately.) 
Afterwards, the suspension of erythrocytes was carefully 
aspirated, and the plates were gently rinsed with PBS and 
examined on a light box. Colonies that had bound anti-H 
antibody were macroscopically visible as "rosettes 11 consisting 
of red cells adherent to the colonies. 

Purification of Blood Group A (tt-l.31GalNAc Transferase ; 

Group A transferase was isolated from human blood group A 
plasma by affinity chromatography on Sepharose 4B (Sigma lot 
no. 104F0334). Column fractions containing the peak of enzyme 
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activity were pooled, mouse serum albumin (Behring Diagnostics 
>98%) was added to a 1% concentration, and aliguots were 
stored at -80 °C until use. The activity of the final 
preparation was determined by a standard radiochemical method. 
One enzyme unit is defined as one nmol of GalNAc transferred 
to 2 '-fucosyllactose acceptor /h. 



Paper Chromatography : 

Descending paper chromatography was performed using 
Whatman No. 3mm or Whatmann No. 1 in the following solvent 
systems: Solvent A, ethyl acetate/pyridine/water (10:4:3); 
Solvent B pyridine/ water/ ethyl acetate (10:11:5:36), upper 
phase. The dried chromatograms were cut into 1-cm strips and 
radiolabeled compounds were eluted with water. Radioactivity 
in an aliquot of each eluate was determined by scintillation 
counting. 

Analysis of L Cell GDP-Fucose Content : 

To identify GDP-fucose in mouse L cells, cells (2.5 X 10 6 ) 
were labeled for 3 days with 250 MCi of D-[2- 3 H]mannose in 30 
ml of complete media. Cells were harvested and extracted with 
60% ethanol for 5 min in a boiling water bath. The aqueous 
extract containing nucleotide sugars was concentrated under 
vacuum and resuspended in a small volume of water. Unlabeled 
GDP-mannose (270 nmol) and GDP-fucose (130 nmol) were added as 
internal standards, as the mixture was subjected to gel 
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filtration chromatography on a Sephadex G-25 column (0.9 X 
42 -cm) equilibrated in 50 mM ammonium acetate. The eluate was 
monitored at 268 run; fractions containing GDP-fucose and 
GDP-mannose were pooled, concentrated under vacuum, and 
resuspended in 200 pi of water. This was subjected to 
fractionation by HPLC on a weak anion exchange column (AX300, 
4 mm X 24 cm, Pierce Chemical Co.)- The sample was applied to 
the HPLC column equilibrated in 100 mM triethylamine acetate, 
pH 7.0, and was eluted with a linear gradient from 100 to 300 
mM triethylamine acetate, pH 7.0 in 50 min at a flow rate of 2 
ml /min. The eluant was monitored at 268 nm, and fractions (0.5 
ml) were collected for scintillation counting and subsequent 
analysis. The unlabeled nucleotide sugar internal standards 
were identified by their characteristic elution times 
(GDP-fucose, 35.5 min. approximately 800 cpms; GDP-mannose, 
33.0 min. approximately 1000 cpms). The radioactive peaks 
corresponding to the fractions coeluting with unlabeled 
GDP-fucose and GDP-mannose were subjected to hydrolysis with 
0.1 N HC1 for 45 min at 100°C. These were then fractionated 
by descending paper chromatography on Whatman No. 3MM in 
solvent B for 20 h, in parallel with L-[ u C]fucose and D-[ 14 C] 
mannose standards. In each case, approximately 30% of the 
counts hydrolyzed were recovered as the appropriate 
monosaccharide • 
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Analysis of Fucose-labeled Glvcopeptides : 

Radiolabeled glycopeptides were prepared and analyzed. 
House L cells (2 X 10 6 ) were labeled for 3 days with 200 /iCi of 



L-[6- 3 H] fucose in 20 ml of complete media. Cells were 
harvested and extracted with chloroform, and then water r and 
the pellet remaining after the final extraction was subjected 
to exhaustive digestion with Pronase (Behring Diagnostics) . 
This was then desalted by gel filtration chromatography on 
Sephadex G-25-80. This material was concentrated under vacuum, 
and an aliquot was hydrolyzed in 0.1 N HCl at 100°C for 45 
min. The hydrolysate was then subjected to descending paper 
chromatography on Whatman No. 3HH in solvent B for 20 h, in 
parallel with an L-[ U C] fucose standard. Approximately 26% of 



the counts present in the macromolecular material were 
released and cochromatographed with the radiolabelled fucose 
standard . 

Assay of GDP-L-Fucoseifl-D-Galactoside 2-ce-L- 

rucpsv Ityangf erase ; 

Cultured cells were washed with PBS, pelleted by 
centrifugation, and resuspended in a small volume of 25 mM 
sodium phosphate, pH 6.1, containing 0.5% Triton X-100. 
Volumes were adjusted to achieve a protein concentration of 
approximately 5 mg/ml (BCA method, Pierce Chemical Co.). 
Extracts were typically assayed immediately after preparation. 
The standard assay contained 5-20 pi of enzyme solution 
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(typically 30-100 *xg of cell extract protein or 15 /xl of 
serum) in 40 ^1 of 25 mM potassium phosphate, 0.1% Triton 
X-100, 3 MM GDP-[ 14 C]fucose, 25 mM phenyl-0-D-galactoside, and 
5 mM ATP. The pH of the reaction mixture was adjusted to a 
final measured pH of 6.1. Assays were terminated after an 
appropriate period of time by the addition of 20 jil of 
ethanol. The mixture was then centrifuged at 15,000 X g for 5 
min. The supernatant was collected, spotted on Whatman No. 1, 
and subjected to fractionation by descending paper 
chromatography for 4 h in Solvent A. Radioactivity was then 
determined as described above. In all cases, parallel 
reactions were done in the absence of added phenyl-0- 
D-galactoside acceptor to allow correction for endogenous 
acceptor molecules. No products of endogenous acceptor 
molecules cochromatographing with fucosylated phenyl- 
0-D-galactoside were identified in any samples. 

g-Fucosidase Digestion ; 

[ 14 C]Fucosylphenyl-0-B-galactoside (approximately 10,000 
cpm) was isolated from fucosyltransferase assays by paper 
chromatography, concentrated from the water eluate under 
vacuum, and resuspended in 5 /il of water. This was digested 
with 0.025 units of bovine kidney a-L-fucosidase (EC 3,2.1.51) 
in a final volume of 20 /il containing 5 mM sodium citrate, pH 
6.0, at 37 °C for 1 h. This mixture was then fractionated by 
descending paper chromatography on Whatman No. 1 in Solvent A 
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for 4 h. Products of the digestions were identified by 
comparison to parallel separations of authentic L-[ u C]fucose, 
and purified [ u C]fucosylphenyl-0-D-galactoside synthesized by 
human plasma H (a-1, 2) fucosyltransf erase. 

Indirect Immunoflu orescence; 

Immunofluorescence was performed on cells plated on 
8-vell tissue culture chamber slides (Lab-Tek) . Cells were 
plated at a density of 5 X 10 V well 24 h prior to analysis. 
Anti-H and anti-A primary antibodies were diluted in PBS 
containing bovine serum albumin at 2 mg/ml, to a final 
concentration of 10 jig/ml. Cell-bound primary antibodies were 
detected with FITC -conjugated goat anti-mouse IgM, diluted to 



40 Mg/ml in PBS containing 2 mg/ml bovine serum albumin. 

Hapten Inhibition - Plated cells were washed twice with 
PBS, and were incubated for 30 min at 4°c with 100 /il of 
diluted anti-H antibody, or with 100 Ml of diluted antibody 
containing different oligosaccharide haptens, each at a 
concentration of 20 mM. The chambers were then washed twice, 
and 100 fil of FITC-conjugated goat anti-mouse IgM was added. 
After 30 min at 4°C, the chambers were washed three times with 
PBS, and the cells were fixed at room temperature for 10 min 
in 3.7% formaldehyde in PBS. The cells were washed twice with 
PBS, the chambers were removed, and the slide was mounted in 
PBS containing 25% glycerol. Cells were examined by 
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fluorescence microscopy using a Zeiss Axiophot photomicroscope 
equipped with fluorescence epi illumination. 

Labeling of Intact Cells with Human Blood Group A (ot- 
l,3)GalNac Transferase - Cells plated in culture slide 
chambers were washed twice with 150 mM NaCl and were then 
fixed for 10 min at room temperature in 3.7% formaldehyde in 
150 mM NaCl. The cells were washed three times with 150 mM 
NaCl and were then incubated with 100 Ml of complete 
transferase reaction mixture or a control mixture. The 
complete transferase mixture consisted of 150 mM NaCl, 15 mM 
MnCl 2 , 50 mM sodium cacodylate, pH 6.8, 0.2% bovine serum 
albumin, 1 mM UDP-GalNAc, and 1.14 units of human blood group 
A (a-1, 3)GalNAc transferase. Control mixtures consisted of 
identical components except for the omission of either 
UDP-GalNAc or blood group A (cr-1, 3) GalNAc transferase. 
Incubations were performed at 37 °C and were terminated after 
either 4 or 24 h by washing the chambers twice with PBS. The 
cells were then analyzed by indirect immunofluorescence using 
mouse monoclonal anti-A or anti-H antibodies, as described 
above for the hapten inhibition studies. 

Southern Blotting ; 

Genomic DNA was digested to completion with restriction 
enzymes (8 units//ig DNA, overnight digestion) . The 
restriction fragments were fractionated by electrophoresis 
through 0.6% agarose gels buffered in Tris-borate-EDTA. The 
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DNA fragments were then transferred to nylon membranes 
(Hybond-N, Amersham Corp.), according to the standard Southern 
blotting method* Blots were prehybridized at 39°C for at 
least 2 h in 50% formamide, 5 SSC (1 X SSC is 150 mM NaCI, 15 
mH sodium citrate, pH 7.0, 1 X PE is 50 mM Tris, pH 7.5, 0.1% 
sodium pyrophosphate, 1% sodium dodecyl sulfate, 0.2% 
polyvinylpyrrolidone (H r 40,000), 0.2% Ficoll ^400,000), and 5 
mH EDTA) , and 150 fig/ml denatured salmon sperm DNA. 
Hybridizations were done in the same solution at 39 °C for at 
least 16 h. Blots were washed four times at room temperature 
in 2 X SSC, 0.1% sodium dodecyl sulfate, and then once for 30 
min at 65 °C in 0.75 X SSC, 0.5% sodium dodecyl sulfate. The 
BLUR8 probe consisted of a 300-base pair BamHI segment 
isolated from the BLUR8 plasmid. This fragment was subjected 
to two cycles of gel purification prior to labeling to ensure 
that it was free from contaminating plasmid sequences. Probes 
were labeled with [c^pJdCTP to a specific activity of at least 
5 X 10 8 cpm/fxg using the random priming method. 

Paper Chromatography Descending paper chromatography was 
performed using Whatman No. 40 in ethyl acetate/pyridine/water 
(10:4:3; Solvent A) or using Whatman No. 3 MM in 95% ethanol/1 
M ammonium acetate (7:3; Solvent B) . 14 C-Labelled compounds 
were located by autoradiography of dried chromatogr ams . 
Alternatively, the dried chromatograms were cut into 1 cm 
strips and the radiolabelled compounds were e luted with water. 
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An aliquot of each eluate was mixed with scintillation 
cocktail and radioactivity was determined in a scintillation 
counter . 

Preparati on of Radiolabeled Standards [ 14 C] Fucose- 1-phosphate 
was prepared by enzymatic cleavage of GDP- [ U C] fucose (l nmol) 
with snake venom phosphodiesterase (EC 3.1.4.1, 1 /il, 0.003 
units, Boehringer-Mannheim) in 20 jil of 100 mM Tris-HCI, pH 8 
at 37 °C for 1 h. The reaction was then fractionated by 
descending paper chromatography on Whatman No. 3 MM using 
Solvent B for 20 h in parallel with GDP- [ U C] fucose and 
[ 14 C] fucose. [ 14 C] Fucose- 1 -phosphate (R iucosB ■» 0.45) was then 
eluted from the chromatogram with water and concentrated under 
vacuum. [ U C] Fucosylphenyl-0-D-galactoside was generated from 
GDP- [ U C] fucose (3 MM) and phenyl-/S-D-galactoside (25 mM) by 
the action of ( a 1, 2) fucosyltransf erase activity in human 
serum, using the reaction conditions described below for assay 
of a (If 2) fucosyltransf erase. The products of this reaction 
were fractionated by descending paper chromatography on 
Whatman No. 40 for 4 h in Solvent A. 
[ H C]Fucosylphenyl-0-D-galactoside was identified by 
autoradiography of the dried chromatogram and was then eluted 
from the paper with water and concentrated under vacuum. 
Alternatively, [ u C]fucosylphenyl-/3-D-galactoside was isolated 
from fucosyltransf erase assay mixtures using the Sep-Pak 
procedure described below. 
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Cell Lines and Cell Culture COS-1 cells were obtained from 
the American Type Culture Collection and were grown in 
Dulbecco's modified Eagle's medium containing 10% fetal calf 
serum* 

Protein Determinations Protein concentrations were 
determined by the BCA method (Pierce Chemical Co.) according 
to the manufacturer's instructions. Bovine serum albumin was 
used as the standard. 

Preparation of Cell Extracts Cells were washed with PBS, 
removed from culture dishes with a rubber policeman, and 
pelleted by centrifugation. Cell pellets were resuspended in 
2 volumes of cold 1% Triton X-100 (Surf act amps x-100, Pierce 
Chemical Co.) and sonicated for 15 seconds using a Branson 
sonicator equipped with a micro tip at 50% power. These 
extracts were either assayed immediately, or were stored at 
-20 °C until use. Under these conditions, enzyme activity in 
the frozen extracts was stable for several weeks, but 
deteriorated rapidly on repeated freezethawing. Tissues from 
C3H mice were isolated, minced with a razor blade, suspended 
in 2 volumes of 1% Triton X-100 and sonicated as described 
above, centrifuged at 1500 x gr for 5 min and the supernatants 
were collected. Mouse intestinal mucosa extracts were 
prepared by everting the small intestine onto a thin 
polypropylene rod, scraping the mucosa cells into phosphate 
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buffered saline, and collecting the cells by centrifugation at 
1500 x g. Extracts were then prepared as described above. 

Partial Purificat ion of Mouse Intestinal 
f al . 2 ) Fu cosvltransf erase 

Preliminary experiments indicated that extracts prepared 
from mouse intestinal mucosa contained large amounts of an 
activity that hydro ly zed GDP-fucose in an acceptor-independent 
manner. Since substrate hydrolysis interfered with accurate 
determination of (al , 2) fucosyltransf erase activity, intestinal 
extracts were fractionated by anion exchange chromatography to 
separate GDP-fucose hydrolysis activity from 
(al, 2) fucosyltransf erase activity. All procedures were 
performed at 40 Q C. Two ml of the Triton-solubilized extract 
was made 10 DM in Tris-HCl, pH 7.6, and treated for 5 min with 
2 ml (bed volume) of DEAE-cellulose (DE52, Whatman) previously 
equilibrated with 10 mM Tris-HCl, pH 7.6. The enzyme solution 
was filtered, made 10 mM in sodium phosphate buffer and the pH 
was adjusted to 6.1. Approximately 47% of the enzyme activity 
present in the initial extracts was recovered after the 
DEAE-cellulose fractionation procedure. A second treatment 
with DEAE-cellulose resulted in substantial loss of enzyme 
activity with no significant reduction in GDP-fucose 
hydrolysis activity (data not shown) . When used under 
standard fucosyltransf erase assay conditions, but in the 
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absence of added acceptor, this preparation hydro lyzed less 
than 2% of the GDP-fucose initially present in the reaction. 

ftwp^jii^ fi^iiflp te Fractionation of Serum Serum was prepared 
from freshly drawn blood obtained from a non-Secretor 
individual. The blood was clotted in a glass tube at 37 °C for 
1 h and was immediately fractionated by ammonium sulfate 
precipitation exactly as described. The 20%-40% ammonium 
sulfate fraction was dialyzed against 2 changes of 4 liters of 
water (8 h each) at 4°C. Assay of (al, 2) fucosyltransf erase 
was done immediately. Alternatively, the fractionated serum 

was aliquoted and stored at -20°C until use. 

i 

Ion Exchange Chromatography of Human Milk 
( al , 2 ) Fuc osvltransf erase 

(al,2) Fucosyltransf erase was isolated from human milk by 
published procedures. Briefly, 300 ml of milk from a 
Se-positive donor was defatted by centrifugation and was 
extensively dialyzed against 20 mM sodium cacodylate, pH 6.0. 
This was applied to a column (2.6 x 115 cm) of 
sulfopropyl-Sephadex equilibrated in 20 mM cacodylate, pH 6.0. 
The column was then washed with 1 liter of 20 mM cacodylate, 
pH 6.0, and was e luted with a linear gradient made from 1.5 
liter each of 20 mM cacodylate, pH 6.0, and 500 mM NaCl in 20 
mM cacodylate, pH 6.0. Fractions (13 ml) were collected and 
assayed for (al, 2) fucosyltransf erase activity using 
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phenyl-0-D-galactoside acceptor as described below. Fractions 
130-144 contained (al,2) fucosyltransf erase activity. These 
were combined, concentrated to 1 ml by ultrafiltration in an 
Amicon stirred cell fitted with a YM5 membrane (MW cutoff = 
5,000), and were then equilibrated against cold deionized 
water. The concentrated enzyme was aliquoted and stored at 
-80°C. 

Synthesis and Characterization of GDP-fl-L-fucose and 
GDP-tt-L-fucose 

GDP-0-L-fucose was synthesized and purified by 
modifications of the method of Nunez et al. (Nunez et al, can. 
J. Chem. , 59, 2086-2095, 1981). The modified method described 
here eliminates the need to separate the anomeric 
1-fucopyranosyl phosphates by differential crystallization 
prior to subsequent synthetic steps. Separation of anomeric 
products is effected by HPLC subsequent to the last synthetic 
procedure. 

Pyridine and tetrahydrofuran (Aldrich) used in the 
synthesis were boiled under reflux over calcium hydride, 
distilled and stored over 4 k molecular sieves. All 
evaporations were performed under reduced pressure on a rotary 
evaporator, with a bath temperature below 35 °C. L-Fucose was 
acetylated exactly as described in Nunez et al ( Can. J. Chem. . 
59 , 2086-2095, 1981). The resulting mixture of 
2, 3,4-tri-O-acetyl-a- and j9-L-fucopyranoses (2.7 g, 9.27 mmol) 
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was phosphorylated using o-phenylene phosphochlor idat e . The 
crude reaction product containing a mixture of anomeric 
1-fucopyranosyl phosphates was fractionated on a Dowex-1 
column (HCOj", 20-50 mesh, 1.5 x 23 cm) preequilibrated in 
water. After application of the anomeric mixture, the column 
was washed with water (500 ml) and eluted with 400 mM 
triethylammonium bicarbonate buffer, pH 7.5 (250 ml). The 
triethylammonium bicarbonate eluant was evaporated to a thick 
syrup under reduced pressure. The syrup was dissolved in 
water and evaporated to dryness. This partially purified 
anomeric mixture of 1-fucopyranosyl phosphates (bis- 
triethylammonium salt, crude yield 70%) was used for the 
synthesis of GDP-0-L-fucose. The anomeric 1-fucopyranosyl 
phosphates (200 mg) were first repeatedly dissolved in dry 
pyridine and then evaporated to dryness in vacuo. Guanosine 
5'-phosphomorpholidate (400 mg) was then added to the dried 
anomeric 1-fucopyranosyl phosphates. This mixture was 
subjected to repeated resuspension in dry pyridine and 
evaporation to dryness in vacuo. The reaction mixture was 
then suspended in dry pyridine (15 ml) and was incubated at 
room temperature. The formation of GDP-/J-L-fucose was 
monitored daily by high performance liquid chromatography 
(HPLC) on a weak anion exchange column (AX 300, 4.6 mm x 22 
cm, Pierce Chemical Co.). An aliquot (10 pi) from the 
reaction mixture was evaporated to dryness, dissolved in water 
and mixed with GDP-[ u C]fucose (2000 cpm) . The sample was then 
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applied to the HPLC column equilibrated in water, and was 
eluted with a linear gradient from 100% water to 250 mM 
triethylammonium acetate, pH 7.0, in 60 min at a flow rate of 
2 ml /min. The eluant was monitored at 268 run, and 0.5 ml 
fractions were collected for scintillation counting to 
identify the co- injected 6DP-[ U C] fucose. Three peaks 
absorbing at 268 nm eluted at approximately 180 mM 
triethylammonium acetate. The retention time of the first 
peak (38.5 min) was identical to the retention time of 
GDP-oc-L-fucose (see below). The second peak eluted at 40.6 
min, and coeluted with the GDP-/J-L-[ W C] fucose standard. A 
third small peak eluting at 42.4 min was not identified. The 
reaction was judged to be essentially complete after 5 days. 
The proportion of a to $ anomer in the final reaction mixture 
was found to be approximately 2:3. The reaction was then 
evaporated to dryness and dissolved in water. GDP-/S-L- fucose 
was purified from this solution on a preparative Hydropore AX 
column (21.4 mm x 25 cm, Rainin Instruments Co.). Aliquots of 
the aqueous solution were applied to the column equilibrated 
in water, and the sample was eluted with a linear gradient of 
100% water to 200 mM triethylammonium acetate, pH 7.0, in 45 
min, at a flow rate of 10 ml/min. The eluant was monitored at 
268 nm. GDP-ce-L-fucose eluted at 36.2 min. GDP-jS-L-f ucose 
eluted at 38.5 min. The GDP-jS-L-f ucose peak was collected and 
evaporated to dryness under reduced pressure. Ammonium 
acetate was removed by repeated co-evaporation with water. 
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The compound that co-eluted with GDP- [ T *C] fucose, and that we 
tentatively identified as GDP-0-L-f ucose , was subjected to 
negative ion fast atom bombardment mass spectrum analysis 
(xenon) , using a VG mass spectrometer (model 7B-250S) • The 
results (m/z [M-H]~ 588) were consistent with this 
identification. The compound was then analyzed by proton 
decoupled 13 C NMR spectroscopy to confirm the anomeric 
configuration of the fucose, and to further establish the 
identity of the compound as GDP-0-L-f ucose . Proton decoupled 
13 C NMR spectra were obtained on a Bruker WM 360, operating at 
909.5 MHz and a sweep width of 221 ppm with 32K data points. 
Probe temperature was 38±1°C. Resonances are reported in ppm 
relative to tetramethylsilane. This analysis yielded the 
following spectral data: 13 C NMR (40 /iM in D 2 0, pH 7.0); 
guanine: 5140.27 (CI and C8) , 156.46 (C2) , 154.38 (C4) , 
118.85 (C5), 161.46 (C6) ; ribosyl moiety: 89.36 (CI), 76.06 
(C2), 73.6 (C3), 86.4 (C4) , 67.8 (C5) ; fucosyl moiety: 100.9 

(CI), 75.09 (C2) f 73.94 (C3), 73.71 (C4) , 73.02 (C5) , 17.35 

(C6) . The distinction between anomeric forms of GDP-L-fucose 
is based upon the chemical shift of anomeric C(l) of the 
fucose ring. The C(l) resonance of the fucose ring in the 
0-anomer is shifted downfield (100.9 ppm) relative to the 
resonance of the anomeric C(l) in the a anomer (98.31 ppm, see 
below) . The values obtained for this compound are essentially 
identical to those reported by Nunez et al. for 
GDP-0-L-f ucose. The resonances attributable to the C atoms of 
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the guanine, ribose and fucose ring are also in agreement with 
the literature values. 

GDP-a-L-fucose was synthesized in a manner similar to 
that described for the preparation of GDP-/3-L-f ucose , except 
that the dicyclohexylammonium salt of 

(a-L-(-) fucose -1 -phosphate (Sigma) was used instead of the 
anomeric mixture of 1-fucopyranosyl phosphates. Analysis and 
purification of GDP-a-L-fucose was performed by HPLC using the 
same conditions described for GDP-0-L-f ucose. The purified 
compound, tentatively identified as GDP-a-L-fucose, was 
subjected to analysis by negative ion fast atom bombardment 
ma6S spectroscopy. The results (m/z [M-H]* 588) were 
consistent with this assignment. The subsequent analysis of 
the compound by 13 C NMR spectroscopy yielded the following 
spectral data: 13 C NMR (50 MM in D 2 0, pH 6.97); guanine: 

5140.27 (CI & C8), 156.46 (C2) , 154.41 (C4) , 118.84 (C5) , 
161.46 (C6); ribosyl moiety: 89.35 (CI), 76.04 (C2) , 71.98 
(C3), 86.4 (C4), 67.8 (C5) ; fucosyl moiety: 98.31 (CI), 70.36 
(C2), 72.99 (C3), 74.3 (C4) , 70.36 (C5) , 17.88 (C6) . The C(l) 

resonance of the fucose ring in this compound (98.31 ppm) is 
consistent with an a anomeric configuration, as reported by 
Nunez et al. The resonances attributable to the C atoms of 
the guanine and the ribose ring are also in agreement with the 
literature values reported for these atoms in GDP-j3-L- fucose. 
GDP-j3-L-fucose was found to be inactive as a substrate for 
(al, 2) fucosyltransf erase. 
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Assay of GDP-L-fucose:fl-D-cralactoside; 2-tt-L-fucosv ltransf erase 

Fucosyltransf erase assays were performed by a 
modification of the procedure reported by Chester et al. 
(Chester et al, Eur. J. Blochem. . 69: 583-593, 1976). The 
standard assay contained GDP- [ U C] fucose (3 j*M) , 
phenyl-0-D-galactoside (25 mM) , ATP (5 mM) and the enzyme 
solution (1-10 /ul) in 20 ^1 of 25 mM sodium phosphate buffer, 
pH 6.1. Based upon preliminary assays, amounts of added 
enzyme activities were adjusted to ensure that reactions were 
linear throughout the period of incubation (4 h for 
fractionated serum, 2 h for each of the other enzyme 
preparations). Under these conditions, less than 15% of the 
substrate was consumed during the incubation period. For the 
determination of pH optima, assays were buffered with 25 mM 
sodium acetate, sodium phosphate, or Tris-HCl, using 
concentrated solutions of these buffers previously adjusted to 
various pH values. The final pH value of each reaction was 
determined with a micro pH probe. In assays to determine the 
apparent Km values for GDP-fucose, GDP- [ 14 C ] fucose was diluted 
with unlabeled GDP-fucose to a final specific activity of 26.3 
mCi/mmol. The concentration of GDP-fucose in this stock 
solution was calculated from the UV absorbance at 254 nm of an 
aliquot diluted in water. The molar extinction coefficient of 
GDP (6=13800 at 254 nm p pH 7.0) was used for this calculation 
since the extinction coefficient of GDP-fucose is not known. 
This stock was then used to yield variable GDP-fucose 
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concentrations (3 - 300 fM) in the assays. The concentration 
of phenyl-0-D-galactoside in these assays was 25 tqM, for all 
< but the milk-derived enzyme. This was assayed in the presence 

of 75 mM phenyl-j3-D-galactoside since the apparent Km 
exhibited by this enzyme for this acceptor is 15.1 mH. In 
assays to determine apparent Km values for 

phenyl-jS-D-galactoside, the concentration of the acceptor was 
varied from 0.5 to 100 mM. 6DP-[ u C]fucose was present in 
these assays at a concentration of 3 pM. The relatively low 
specific activity of commercially available GDP-[ 14 C] fucose, 
and its cost, necessitated the use of a substrate 
concentration well below its K m in these experiments. All 
assays were performed at 37 # C. Assays were terminated after 
an appropriate period of time by addition of 20 pi of ethanol, 
followed by dilution with 1 ml of water. The mixture was then 
centrifuged at 15,000 x g for 5 min. The supernatant 
contained virtually 100% of the radiolabeled product; this 
was collected, and an aliquot was used for separation and 
quantitation of the fucosylated product using the hydrophobic 
interaction chromatography method described below. All assays 
were performed in duplicate or triplicate and included 
parallel incubations done in the absence of acceptor. The 
values obtained in the absence of exogenous acceptor were 
i subtracted to correct for the presence of endogenous acceptor 

molecules. This background acceptor "activity" was always 
less than 3% of the radioactivity incorporated into the 
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phenyl-0-D-galactoside acceptor. Apparent Michaelis constants 
were derived from Lineweaver -Burke plots of acceptor 
concentration-rate determinations. Intercepts were calculated 
by the least squares estimation method. 

A separation procedure based upon hydrophobic interaction 
chromatography was developed for rapid processing of large 
numbers of samples. This procedure effects separation of 
[ u C]fucosylphenyl-0-D-galactoside product from GDP-[ K C]fucose 
on disposable C-18 Sep-Pak cartridges (Waters-Hillipore) • The 
bottom of the Sep-Pak cartridge is mounted in one hole of a 
two-holed rubber stopper, and a 5 ml syringe is attached to 
the top of the cartridge. The other hole in the stopper is 
attached to a vacuum source (approximately 350 mm of Hg) . The 
mounted Sep-Pak cartridge is prepared by washing with 5 ml of 
acetonitrile, followed by 5 ml of water. This is done by 
separately pipetting each wash solution into the attached 
syringe barrel and drawing it through the cartridge under the 
vacuum created when the rubber stopper (with attached vacuum 
line) is pressed onto the top of a 20 ml plastic scintillation 
vial. The contents of the syringe are aspirated through the 
cartridge into the vial. The fucosyltransf erase reaction 
sample is then loaded into the syringe and aspirated through 
the cartridge into a fresh scintillation vial. The 
[ u C]fucosylphenyl-0-D-galactoside is retained on the Sep-Pak 
cartridge. The cartridge is then washed sequentially by 
aspiration with three 2 ml portions of water, collecting each 
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wash into a new scintillation vial. The 

[ 14 C]fucosylphenyl-0-D-galactoside product is then eluted by 
aspiration with three 2 ml portions of 50% acetonitrile. 
Scintillation cocktail (10 ml, Biosafe II, Research Products 
International) is added to each vial to obtain a clear 
solution and the radioactivity in each is determined by 
scintillation counting. Radioactivity eluting in fractions 
1-4 represents GDP-[ u C]fucose, fucose-1 -phosphate, and 
[ u C]fucose present either as a contaminant in the substrate as 
obtained from the manufacturer (approximately 1%) , or that is 
formed by substrate or product hydrolysis. Fractions 5-7 
represent [ H C]fucosylphenyl-0-D-galactoside. The recovery of 
radioactivity with this procedure exceeds 97%. Reconstruction 
experiments showed that recovery of pure 
[ u C]fucosylphenyl-j3-D-galactoside is essentially 100%. 
Separation and recovery are independent of pH and enzyme 
source, and are tolerant of detergents (Triton X-100 and 
Lubrol-PX) at concentrations up to 2%. Product separation by 
this method requires approximately one minute per sample. 
Sep-Pak cartridges have been reused indefinitely with no 
deterioration in performance. 

The Sep-Pak method does not separate 6DP-[ H C]fucose from 
[ 14 C]fucose-l-phosphate and thus cannot detect nucleotide 
pyrophosphatase activity that may consume GDP-fucose while 
generating fucose-l-phosphate and/ or fucose. ATP (5 mM) was 
therefore included in all assays to inhibit nucleotide 
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pyrophosphatase activity. Effective inhibition of 
pyrophosphatase activity was confirmed by descending paper 
chromatography analysis of mock fucosyltransferase assays 
using each enzyme preparation. Each enzyme source was 
incubated for 1 to 4 h at 37 °C in 20 pi of standard assay 
cocktail, but in the absence of phenyl-0-D-galactoside. After 
the reactions were terminated, an aliquot of each was spotted 
on Whatman No. 3 MM, fractionated by descending paper 
chromatography for 20 h in solvent B, and radioactivity was 
determined as described above. Authentic GDP- [ 14 C]f ucose, 
[ 14 C]fucose-l-phosphate f and L-[ u C]fucose standards were 
chromatographed in parallel. This system separates 
GDP-f ucose, f ucose- 1 -phosphate, and f ucose. It therefore 
allows quantitation of the amount of GDP- [ u C]f ucose remaining 
at the end of the incubation, and thus provides an estimate of 
pyrophosphatase activity. Under the standard assay 
conditions, more than 97% of the GDP- [ 1i C]f ucose initially 
present in each mock reaction remained unhydrolyzed and 
available for transglycosylation. 

Each enzyme preparation was also tested for a-fucosidase 
activity that could hydrolyze the 

[ u C]fucosylphenyl-0-D-galactoside product and prevent accurate 
determination of (ccl, 2) fucosyltransferase activity. Enzyme 
preparations were incubated with purified [ 14 C]fucosylphenyl-0- 
D-galactoside (7000 cpm, 10 pmol) in fucosyltransferase assay 
buffer, but in the absence of added GDP- [ u C]f ucose or phenyl- 
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jS-D-galactoside. These reactions were incubated at 37°C for 
various times, and were then fractionated by the Sep-Pak 
method to determine the amount of remaining [ 14 C]fucosylphenyl- 
0-D-galactoside. Fractionated human serum and mouse 
intestinal mucosa extracts each contained significant 
ot-fucosidase activity. However, inclusion of 10 mM L-fucose 
in the reaction cocktail effectively inhibited fucosidase 
activity in these enzyme preparations. Therefore 10 mM 
L-fucose was included when assaying these enzyme sources. 
Significant amounts of a-fucosidase activity were not detected 
in any of the other cell extracts; product hydrolysis never 
exceeded 1%/h under the standard assay conditions. 
Reconstruction experiments with these extracts showed that 
L-fucose at a concentration of 10 mM does not alter the 
activity of (al , 2 ) f ucosyltransf erase • 

In aggregate, these experiments showed that, under the 
conditions used to assay fractionated human serum, human milk, 
and mouse intestinal mucosa, or crude cell extracts, the 
quantity of [ u C]fucosylphenyl-£-D-galactoside product 
determined reflected true enzymatic activity, because both 
product hydrolysis and acceptor- independent hydrolysis of the 
GDP-fucose substrate were negligible. 

Digestion of r H C1FucosvlPhenvl-fl-D-aalactoside with 
g-Fucosidase 
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[ u C]Fucosylphenyl-0-D-galactoside (approximately 10,000 
cpm) was digested with 0.025 units of bovine kidney 
a-L-fucosidase (EC 3.2.1.51, Sigma) in a final volume of 20 /il 
containing 5 mM sodium citrate pH 6.0, at 37°C for 1 h. This 
mixture was then fractionated by descending paper 
chromatography on Whatman No. 40 in Solvent A for 4 h. The 
products of the digestion were identified by comparison to 
parallel separations of L- [ 14 C ] f ucose and [ u C]fucosylphenyl-0- 
D-galactoside standards. 




cments fr om Transfer 



Clone S2-2 

High molecular weight genomic DNA was isolated from the 
H- expressing secondary transfectant s2-2, digested to 
completion with EcoRI, and fractionated through a 1% agarose 
gel buffered in tris-borate-EDTA. The region of the gel 
containing the 2.7 kb and the 3.4 kb human EcoRl fragments was 
divided into 3 mm slices and the DNA in these was isolated by 
electroelution . Aliquots of the size-fractionated DNA were 
analyzed by southern blotting with a radiolabeled Alu probe 
(BLUR8), using hybridization and wash conditions described 
above. Fractions containing either the 2.7 kb or the 3.4 kb 
fragment were used separately to prepare phage libraries in 
lambda gtll. These libraries were screened with a 
radiolabeled BLUR8 probe. Positive phages isolated from a 
tertiary screen were used to prepared phage DNA, and phages 
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containing either the 2.7 kb EcoRl fragment or the 3.4 kb 
EcoRl fragment were identified by Southern blotting. The 3.4 
kb or the 2.7 kb inserts were released from the phage arms by 
EcoRI digestion, purified by agarose gel electrophoresis and 
electroelution, and individually subcloned between the EcoRI 
sites in pWE15. 

COS-1 cell Transfection Plasmid DNAs were transf ected into 
COS-1 cells by the DEAE-dextran procedure. Seventy two hours 
after transfection, calls were harvested, extracts were 
prepared as described above, and extracts were subjected to 
assays for (al,2) fucosyl transf erase activity, for GDP-fucose 
hydrolysis activity, and for a-fucosidase activity. 

Isolation of Human a (1.2) FT cDNA clones . 1.8 X 10 6 recombinant 
clones from an A431 cell cDNA mammalian expression library 
were screened by colony hybridization, using a 32 P- labeled 1.2 
kb Hlnfl fragment of pH3.4 as a probe. Filters were 
hybridized for 18 hours at 42 °C in a hybridization solution 
described above, washed, and subjected to autoradiography. 
Two hybridization-positive colonies were obtained and isolated 
via two additional rounds of hybridization and colony 
purification. Preliminary sequence analysis of the inserts in 
both hybridization-positive cDNA clones indicated that they 
each were in the anti-sense orientation with respect to the 
pCDM7 expression vector promoter sequences. The largest 
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insert was therefore re-cloned into pCDM7 in the sense 
orientation for expression studies, and the resulting plasmid 
was designated pCDM7-a(l,2)FT. 

Flow cytometry Analysis . COS-1 cells were transfected with 
plasmid DNAs using the DEAE-dextran procedure described above. 
Transfected cells were harvested after a 72 hour expression 
period and stained either with mouse IgM anti-H monoclonal 
antibody (Chembiomed; 10 /ig/ml) or with a mouse IgM 
anti-Lewisa monoclonal antibody (Chembiomed; 10 nq/ml) . Cells 
were then stained with f luorescein-conjugated goat anti-mouse 
IgM antibody (Sigma; 40 ng/ml) and subjected to analysis by 
flow cytometry. 

Northern and Southern Blotting A431 poly (A) -plus RNA (10 
jig/lane) was subjected to Northern blot analysis. Genomic DNA 
(10 ixq/lane) was subjected to Southern blot analysis. Blots 
were probed with ^P-labeled 1.2 kb Hinfl fragment of pH3.4. 

DNA Sequence Analysis . The insert in pCDM7-a(l, 2) FT was 
sequenced by the method of Sanger using T7 DNA polymerase 
(Pharmacia) and 20-mer oligonucleotide primers synthesized 
according to the sequence of the cDNA insert. Sequence 
analyses and data base searches were performed using the 
Microgenie Package (Beckman) and the Sequence Analysis 
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Software Package of the University of Wisconsin Genetics 
Computer Group. 



Assay of g(1.21fuc oBvltransferase Activity, Cell extracts, 
conditioned medium from transfected COS-l cells, and 
IgG-Sepharose-bound enzyme were prepared and assayed for 
a (1,2) fucosyltransf erase activity by the methods described 
above. One unit of a (1,2) fucosyltransf erase activity is 
defined as 1 pmol product formed per hour. The apparent 
Michaelis constant for the acceptor phenyl-0-D-galactoside was 
determined exactly as described above. 

construction and Ana lysis of a Protein A-gfl.2)FT Fupign 
Vector . A 3196 bp Stul/Xhol segment of the cDNA insert 
containing the putative catalytic domain and 3 '-untranslated 
sequences was isolated from pCDM7-a(l,2)FT. This fragment was 
blunt-ended using Klenow enzyme and ligated to phosphorylated 
and annealed oligonucleotides (CGGAATTCCCCACATGGCCTAGGT, 
CCTAGGCCATGTGGGGAATTCCG) designed to reconstruct the coding 

A 

sequence between the putative transmembrane segment proximal 
to the StuI site, corresponding to amino acids 33 through 365 
of SEQ ID NO: 6. The ligated fragment was gel purified, 
digested with ZcoRI and then gel purified again. This EcoXl 
linkered fragment was ligated into the unique EcoRl site of 
pPROTA. One plasmid, designated pPROTA-a (1,2) FT C , containing a 
single insert in the correct orientation, was analyzed by DNA 
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sequencing to confirm the sequence across the vector, linker 
and insert junctions. Plasmids pPRPTA~a ( 1 , 2 ) FTC , pPROTA, 
pCDM7-a(l,2)FT, or pCDM7 were transfected into COS-1 cells. 
Following a 72 hour expression period, a (1,2) FT activities in 
the media, associated with cells, bound to a Sepharose IgG 
matrix, or to a control Sepharose matrix, were quant ita ted. 



Example II. cloning a nd expression of a DNA Sequence 

Encoding a UDP-Gal;fl-D- Gal ( 1. 4 1-D-GlcNAC 
flfl.3\aalactosvltrans f erase f DNA SEP ID NO:3, 

Protein SEP ID NO: 4) 
A Gene Transfer Approach to Isolate Cloned, Functional, 
0-D-galactosy 1-1 , 4 -N-acety 1-D-glucosaminide at- 
1,3-galactosyltransferase cDNAs. Tissue- and cell-specific 
expression of surf ace- localized terminal Gal(otl-3)Gal linkages 
is associated with expression of cognate (al-3)GTs that 
catalyze a transglycosylation reaction between UDP-Gal and N- 
acetyllactosamine, COS-1 cells construct surface-expressed 
polylactosamine molecules that can function as an acceptor 
substrate for (al-3)GT but do not express this enzyme or its 
surface-localized product (see below) . The inventor therefore 
expected that cloned cDNAs encoding an <al-3)GT would, if 
expressed in COS-1 cells, generate the surface-localized 
oligosaccharide product of that enzyme [terminal Gal(al-3)Gal 
linkages]. Moreover, these particular transf ectants could be 
isolated by virtue of adherence to plates coated with a lectin 
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(GS I-B 4 ) that specifically binds terminal Gal (crl-3) Gal 



linkages* A standard transient expression system was used for 
this approach since it provides for the rescue of transfected 
cDNAs that determine the expression of cell surface molecules 
on COS-1 cells and allows the facile construction of large 
cDNA libraries in a mammalian expression vector. 

Isolation of a Cloned cDNA That Determines Expression of 
GS I-B 4 Binding Activity in Transfected COS-1 Cells. Mouse F9 
teratocarcinoma cells express an (al-3)GT, and this enzyme 
activity increases concomitant with retinoic acid-induced 
differentiation of these cells. The inventor therefore 



prepared a cDNA expression library from retinoic 
acid-differentiated F9 cells and screened this library for 
cDNAs that determine expression of GS I-B A binding activity in 
transfected COS-1 cells. One plasmid (pCDM7-aGT) was isolated 
that, when transfected into COS-1 cells, determined expression 
of surface molecules that directed specific adherence of cells 
to culture dishes coated with GS I-B 4 . Fluorescence-activated 
cell sorting analysis confirmed these observations. COS-1 
cells transfected with pCDM7-ocGT, but not cells transfected 
with pCDM7, stained brightly with fluorescein isothiocyanate- 
conjugated GS I-B 4 . This staining could be inhibited with 
raffinose, a hapten for this lectin. These observations 
indicate that pCDM7-aGT determines de novo expression of 
surface-localized molecules recognized by GS I-B 4 and thus 
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expression of terminal Gal(al-3)Gal linkages on cell surface 

oligosaccharides . 

cDNA Sequence Analysis Predicts a Protein with a 
Transmembrane Topology* The cDNA insert in pCDM7-aGT (SEQ ID 
NO: 3), Fig. 2, is 1500 base pairs long and contains a single 
long open reading frame in the sense orientation with respect 
to pCDM7 promoter sequences. Three methionine codons are 
found within the first 15 codons of this reading frame; the 
inventor assigned the most proximal of these as the initiator 
codon, based on Kozak's rules for mammalian translation 
initiation. This reading frame predicts a protein of 394 
amino acids in length (SEQ ID NO: 4), Fig. 2, with a molecular 
mass of 46,472 Da. Hydropathy analysis indicates that this 
protein has features of a type II transmembrane molecule that 
is topologically identical to that predicted for two other 
mammalian glycosyltransf erases. This topology predicts a 41- 
amino-acid, cytoplasmically oriented, NH 2 -terminal segment; a 
single transmembrane domain consisting of a 19-amino-acid 
hydrophobic segment flanked by basic residues; and a large 
(presumably catalytic) COOH-terminal domain that would 
ultimately be targeted to the lumen of the Golgi. Two 
potential N-glycosylation sites are present, indicating that 
this protein, like other glycosyltransf erases, may be 
synthesized as a glycoprotein. This cDNA sequence contains a 
long 5' untranslated region, with ATG codons at -90 and -251, 
suggesting that trans lational control mechanisms may 
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participate in the regulation of expression of this sequence* 



This is reminiscent of another mammalian glycosyltransferase 
whose transcript also contains upstream ATG codons. The 



characteristic cleavable signal sequence that may exist in one 
form of a murine 0-1,4-galactosyltransf erase. 

Searches of the currently available protein and nucleic 
acid data bases (Protein Identification Resource, Release 21.0 
and GenBank, Release 60.0) identified no sequences with 
significant similarity to the (ol-3)GT DNA sequence, including 
the sequences of a murine 0-1,4-galactosyltransf erase and a 
fat a-2 , 6-sialyltransf erase . 

Expression of a Catalytically-Active, Secreted Protein 
A-(al-3)GT Fusion Protein. The inventor wished to confirm 
that this cDNA encodes an (al-3)GT and to simultaneously 

* 

exclude the formal possibility that it instead encodes a 
trans-acting molecule that induces (al-3)GT activity by 
interaction with an endogenous gene, transcript, or protein. 
Therefore, sequences corresponding to the putative catalytic 
domain of this protein (residues 63-394 of SEQ ID NO: 4) were 
fused in- frame to a secret able form of the IgG binding domain 
of staphylococcus aureus protein A in the mammalian expression 
vector pPROTA yielding the vector pPROTA-aGT c . This vector was 
then tested for its ability to express a catalytically active, 
secreted and soluble protein A-(al-3)GT fusion protein. 



putative NHj-terminal end of this protein lacks a 
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COS-1 cells transfected with the pCDM7 vector or with the 
pPROTA vector generated no detectable cell-associated or 
released (al-3)GT activity. By contrast, extracts prepared 
from COS-1 cells transfected with pCDM7-aGT or with the 



respectively, of (al-3)GT activity. Moreover, conditioned 
media prepared from cells transfected with pCDM7-otGT or 



4,155 units or 50,438 units, respectively. Importantly, the 



specifically bound to a IgG-Sepharose matrix, whereas the 
released activity generated by pCDM7-oGT did not interact with 



this affinity adsorbent. These results indicate that this 
cloned cDNA encodes an (al-3)GT, show that information 
sufficient to generate a catalytically active (al-3)GT resides 
within the 332 amino acids distal to the putative 
transmembrane segment, and show that the catalytic domain can 
be affinity purified in an enzymatically active state as a 
portion of a bipartite fusion protein. 

Determination of the Structure of the Trisaccharide 
Product of the (al-3)GT. Exoglycosidase digestion was used to 
confirm the a-anomeric linkage predicted for the 
oligosaccharide product generated by the recombinant enzyme. 
Radiolabeled trisaccharide product was prepared from 
UDP-[ H C]Gal and N-acetyllactosamine by using the 
IgG-Sepharose-bound enzyme activity generated by pPROTA-oGT c . 



pPROTA-oGT c vector contained 4574 and 20,500 total units, 



pPROTA-aGT c contained soluble (al-3)GT activity amounting to 



released activity generated by pPROTA-aGT c could be 
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Digestion of the HPLC-purif ied trisaccharide product with a 
galactosidase resulted in quantitative release of [ u C]Gal, 
whereas the trisaccharide was completely resistant to 0- 
galactosidase digestion. 



N-acetyllactosamine acceptor is involved in the glycosidic 
linkage formed by the recombinant enzyme, the inventor 



incubated it with IgG-Sepharose-bound protein A-(oel-3)GT 
activity and 1 mM UDP-Gal under the standard (ai-3)GT reaction 
conditions* The trisaccharide product of this reaction was 
purified and subjected to methylation analysis. Radioactive 



2,4, 6-trimethylgalactose was identified. Together, these 
results indicate that the recombinant enzyme can utilize 
UDP-Gal and N-acetyllactosamine as substrates to construct a 
trisaccharide product with the structure Gal (al-3) Gal (01-4)- 
GlcNAc . 

Northern Blot Analysis. The (al-3)GT cDNA hybridizes to 
a single 3.6-kilobase transcript in F9 teratocarcinoma cells. 
The inventor's DNA sequence analysis of another cloned 
(otl-3)GT cDNA isolated by colony hybridization indicates that 
the insert in pCDH7-aGT represents the 5' end of this 
transcript. The remaining 2.1 kilobases of this transcript 
consist of 3' untranslated sequence not rescued by the 
expression cloning procedure. 



To confirm that carbon 3 of the galactose in the 



prepared a [*H] Gal- labeled N-acetyllactosamine acceptor and 
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The specific activity of (otl-3)GT in retinoic 
acid-differentiated F9 teratocarcinoma cells is approximately 
4-fold higher than that in untreated F9 cells. Northern blot 
analysis indicates that steady-state levels of the (al-3)GT 
transcript also increase concomitant with retinoic 
acid-induced differentiation of F9 teratocarcinoma cells. 
These results are similar to those reported in F9 cells with 
0-1, 4-galactosyltransf erase and suggest that the dynamic 
changes in cell surface oligosaccharide structures known to 
accompany in vitro d ifferentiation of this cell line are 
associated with significant changes in glycosyl transferase 

gene expression. 

Experimental Procedures fo r Example II. "Cloning and 
egression of a DNA sequence encoding a UDP-Gal:fl-P- 
Gal f 1 . 4 ) -D-GlcNAc a ( 1 . 3 ) qalactosvltransf erase 1 * . 
Construction of an F9 Cell cDNA Library . A cDNA library 
was prepared from poly (A)* UNA isolated from retinoic acid- 
differentiated mouse F9 teratocarcinoma cells by using the 
procedure of Seed and Arruffo, and the mammalian expression 
vector pCDM7. pCDM7 is a progenitor of the vector pCDM8; 
pCDM7 lacks the polyoma sequences present in pCDM8, but is 
otherwise virtually identical. The library contained 3 x 10* 
independent recombinants . 

Isolation of a Mouse (otl-3)GT cDNA Clone. Plasmid DNA 
was prepared from an amplified portion of the library and was 
trans fee ted in to COS-1 cells by using the DEAE-dextran 
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procedure. Forty samples of 5 x 10 5 COS-1 cells (in 100-mm 
dishes) were transfected with 50 fig of plasmid DNA each. 
After a 72-hr expression period, the transfected COS-l cell 
monolayers were harvested and panned on dishes coated with 
Griffonia simolicif olia isolectin I B 4 (GS I-B A ) . Lectin 
panning dishes were prepared by using 10 M9 of GS I-B 4 per ml 
in phosphate-buffered saline (PBS) containing 0.1 mM Ca 2+ and 
0.1 mM Mn 2+ . Plasmid DNA molecules were rescued from adherent 
cells and were transformed into the Escherichia coli host 
MC1061/P3. Plasmid DNA was prepared from these transf ormants 
and was subjected to an additional screening by the same 
procedure. Sib selection was subsequently used to screen for 
plasmids that determined expression of GS I-B 4 binding activity 
in COS-1 cells. E. coli transf ormants containing plasmid 
molecules rescued from the second screening were plated to 
yield 16 pools containing between 100 and 5000 colonies each. 
Plasmid DNAs were prepared from replica plates and were 
transfected separately into COS-1 cells, and the transf ectants 
were screened by panning on GS I-B^-coated dishes. These 
experiments indicated that approximately 1 out of 1000 
colonies contained cloned cDNAs determining the GS I-B^-binding 
phenotype. One "active" - 1000-colony pool was subdivided 
into several smaller pools, and these were each tested for GS 
I-B 4 -binding activity. Three subsequent rounds of sib 
selection with sequentially smaller, active pools identified a 
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eingle plasmid (pCDM7-a6T) that directed expression of GS 
I-B 4 -binding activity in COS-l cells. 

Flow Cytometry . COS-l cells transfected with plasmid 
DNAs were harvested 48-72 -hr after transfection. These were 
stained with either fluorescein isothiocyanate-conjugated GS 
I-B A at 10 fig per ml in staining media or with fluorescein 
isothiocyanate- conjugated GS I-B 4 that had been previously 
incubated with 50 mM raffinose. Cells were then subjected to 
analysis by fluorescence- activated cell sorting as described 
above . 

northern B lotting and DMA Sequence Analysis. Northern 
blots were hybridized with radiolabeled pCDM7-aGT cDMA insert 
at 42 °C in hybridization solution. DNA sequencing was 
performed with the chain termination method by using 
oligodeoxynucleotides synthesized according to the sequence 
within the cDNA insert. Sequence data base searches and 
analyses were performed with the Sequence Analysis Software 
Package published by the University of Wisconsin Genetics 

Computer Group. 

Aesav of ffti-31GT and Produc t Characterization. Extracts 

were prepared from transfected COS-l cells. Cell extracts, 

conditioned medium from transfected cells, or 

IgG-Sepharose-bound enzyme was assayed for (al-3)GT. One unit 
of (al-3)GT activity is defined as 1 pmol of Gal transferred 
to N-acetyllactosamine acceptor per hour. 
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HPLC-purif ied, radiolabeled oligosaccharide reaction 
products were subjected to digestion with either 
a-galactosidase (Sigma, 20 mU) or 0-galactosidase (Sigma, l 
mU) for 1 hr at 37 °C in buffers recommended by the 
manufacturer. Reaction products were then fractionated by 
HPLC. Methylation analysis of reaction product (s) was carried 
out according to standard procedures. 

ro yistruction an d Analysis of the Protein A-(ttl-3)GT 
Fusion Vector . A 1050-base-pair segment of the (al-3)GT cDMA 
containing the putative catalytic domain was excised from 
pCDM7-aGT by digestion with EcoR I. This was cloned into the 
EcoR I site of pPROTA by using a double-stranded linker 
( 5 ' -ACGGAATTCCGT-3 ' ) to maintain the correct reading frame, 
yielding plasmid pPROTA-oGT c . 

Plasmids pPROTA-aGT c , pCDM7-aGT, and pPROTA were 
separately transfected into COS-l cells. After a 72-hr 
expression period, the media were harvested and subjected to 
low-speed (300 x g for 8 min) and high-speed (100,000 x 3 for 
1 hr) centrifugations. Supernatants were then adjusted to 
0.05% Tween 20 and were incubated batchwise with 100 ja1 of 
preequilibrated IgG-Sepharose or Sepharose 6B overnight at 
4«>C. The matrices were then thoroughly washed and used 
directly in (al-3)GT assays. 
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Example III. Isolation of a cl oned human cDNA that encodes a 



In one embodiment, the present invention provides a gene 
transfer system that allows isolating cloned cDNAs encoding 
functional a (1, 3) fucosyl transferase [at(l,3)FT] or 
a ( 1 , 4 ) f ucosyltransf erase [a (1,4) FT) molecules or that 
otherwise determine a(l,3)FT or a(l,4)FT expression, without 
the need to first purify the enzyme* This system instead 
exploits existing reagents that allow detection of the cell 
surface-expressed oligosaccharide product of these" enzymes, 
and that provide for specific assay of their enzymatic 
activity. 

This approach requires a recipient host cell with 
specific properties that allow selection of the appropriate 
cloned cDNA molecules. The host must not express a(l,3)FTs r 
nor cognate surface Gal 0(1,4) [Fuca(l,3) ]GlcNAc linkages 
(SSEA-1 structures) . However, this host must synthesize the 
appropriate substrates for surface display of SSEA-1 
molecules. These substrates include the nucleotide sugar 
GDP-fucose, and surf ace-expressed glycocon jugate molecules 
that may serve as oligosaccharide acceptors for the 
transglycosylation reaction. Each of these properties are 
fulfilled by COS-1 cells. 



GDP-Fuc : fl-D-Gal ( 1 . 4 / 1 . 3 \ -D-GlcNac f /GIO - 
a f 1. 3 / 1. 4 >-f ucosvl transf erase. fDNA SEP ID 



WOil, Protein SEP ID NO; 2) 
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Fluorescence-activated cell sorter (FACS) analysis 



indicated that COS-1 cells do not express surface-localized 
SSEA-1 determinants. Enzyme analyses performed with COS-1 
extracts confirmed that absence of SSEA-1 expression was due 
to a deficiency of a (1,3) FT activity. The inventor expected 
that COS-1 cells would contain substrate levels of GDP-f ucose 
within their Golgi, since with the exception of certain 
lectin-resistant mutant cells lines, virtually all mammalian 
cells synthesize GDP-f ucose and translocate it into the Golgi 

lumen. COS-1 cells also construct surface-expressed 
glycoconjugates containing unsubstituted polylactosamine 



moieties that represent potential oligosaccharide acceptors 
for a (1,3) FT activity determined by a transfected cDNA. The 
inventor confirmed that these substrates are available for use 
in the construction of surface-expressed, terminal f ucose 
linkages by demonstrating expression of a different 
terminally-linked f ucose linkage (H Fuccr (1, 2)Gal) on COS-1 
cells after transfection with a cloned human gene fragment 
that the inventor had previously shown to determine expression 
of an a (1,2) FT. The inventor therefore observed that COS-1 
cells could construct surface-expressed SSEA-1 molecules 
following transfection with a ( 1 , 3 ) FT-determining cDNAs. 

Isolation of a Cloned cDNA That Determines Express ion of an 
an. 31 FT and Surf ace-Localized SSEA-1 Structures 



9 
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The human A431 cell line has been shown to express cell 
surface Lewis blood group a and b structures that represent 
the products of an a (1,4) FT. Enzyme assays performed with 
A431 extracts confirmed that cells also express a 
corresponding a (1,3) FT activity. A cDNA library was therefore 
constructed with A431 cell mRNA in the mammalian expression 
vector pCDM7 and was transfected into COS-1 cells. The 
transfected cells were screened by the procedure of Seed using 
a monoclonal antibody specific for SSEA-1 determinants. 

In order to follow enrichment for an a ( 1 , 3 ) FT-determining 
cDNA during the selection procedure, an assay was employed in 
which 2'-fucosyllactose was used as an acceptor substrate. 
This acceptor can discriminate between the Lewis ce (1,3/1,4) FT 
and nonallelic human a(l,3)FTs, since it is used efficiently 
by the former enzyme but not by the latter. With this assay, 
the inventor was unable to detect any enzyme activity in COS-1 
cells transfected with the A431 cDNA library, or in cells 
transfected with amplified plasmid DNA isolated from the 
initial selection. However, amplified plasmid DNA obtained 
from the second selection was found to direct a low level of 
enzyme activity when transfected into COS-1 cells. A modest 
increment in enzyme activity was obtained after a third 
selection by panning. At this stage, "sib selection" was 
employed to identify and isolate a cloned o (1,3) FT cDNA. 
Pools of clones isolated from the third panning selection were 
tested for their ability to generate o (1,3) FT activity in 
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transfected COS-1 cells. From these experiments, it was 
estimated that approximately one in 500 clones represented a 
plasmid that determined a (1,3) FT expression. One 11 active" 
pool of approximately 400 clones was further subdivided and 
the resulting pools were tested for their ability to generate 
enzyme activity in transfected cells. One clone 
(pCDM7-at(l, 3/1,4) FT) in an active 16 clone pool was found to 
direct very high level expression of the ot(l,3)FT. FACS 
analysis was used to confirm that this plasmid also directs 
surface expression of SSEA-1 (Lewis x) determinants (Fig. 8) . 
COS-1 cells transfected with this plasmid stain brightly with 
ant i- SSEA-1 antibody, but not with a control IgM anti-H 
antibody, whereas cells transfected with pCDH7 vector alone 
exhibit background staining with both antibodies. Identical 
results were obtained in experiments where the transfected 

* 

cells were stained with an anti-Lewis a antibody (Fig. 8) • 

Deduced Protein Sequence of the Cloned cDNA Predicts a 
Transmembrane Topology 

The cDNA insert in pCDM7-ot(l,3/l,4)FT (SEQ ID NO:l) is 
2022 nucleotides in length, and consists of a 72 bp 5 9 
untranslated region, a continuous open reading frame of 1083 
bp, and a 3' untranslated region of 867 bp that is terminated 

by a poly (A) tail. 

This cloned cDNA hybridizes to a single prominent 2.3 kb 
transcript in A431 cells showing that this insert is 
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essentially full-length. The nature of an additional faint 
7.5 kb transcript is at present undefined. The initiation 
codon at the beginning of the long open reading frame is 
embedded within a sequence similar to the Kozak consensus 
initiation sequence and is preceded by two in-frame stop 
codons* There is also a single additional ATG upstream from 
the assigned initiator. This ATG also fits the Kozak 
consensus sequence, but initiates a very short in- frame 
sequence. The long open reading frame predicts a 361 amino 
acid protein (SEQ ID NO: 2) of Mr 42,069 Da. Sequence 
comparisons with the latest DNA and protein sequence databases 
(Protein Identification Resource, Release 21. 0, and GenBank, 
Release 60.0) identified no sequences with significant 
similarity to this sequence, except for a segment within the 
3' untranslated region that is similar to human Alu sequences. 
The 3' untranslated region also contains 20 degenerate copies 
of a 16 nucleotide sequence of unknown functional 
significance. 

Comparisons between the sequence predicted by the insert 
in pCDM7-a(l,3/l,4)FT and four different cloned mammalian 
glycosyltransf erases revealed no obvious primary sequence 
similarities. While these latter enzymes also share no 
extensive primary sequence similarities, they exhibit an 
analogous overall structural organization. Specifically, 
these enzymes are representative of Type II transmembrane 
proteins, each composed of a short, NH 2 -terminal cytoplasmic 
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domain, a single transmembrane segment, and a larger, 
COOH-terminal catalytic domain that ultimately inhabits the 
Golgi lumen. Inspection and hydropathy analysis of the 
predicted protein sequence suggested that this protein 
maintains a similar structural organization. There is a 
single hydrophobic segment near the amino terminus that is 
comprised of 19 amino acids and is flanked by basic residues. 
This putative signal-anchor sequence would place 327 amino 
acids within the Golgi lumen, while leaving 15 residues within 
the cytosolic compartment. 

The Protein Encode d bv pCDM7-ttf 1.3/1. 41FT is a 
Fucosvltransf erase 

Expression data and the predicted topological similarity 
of this sequence to other glycosyltransf erases , show that this 
cDNA encodes an a(l,3)FT. Nonetheless, these data are also 
formally consistent with the possibility that this cDNA 
sequence instead encodes a molecule that trans-determines 
a (1,3) FT activity by interaction with an endogenous gene, 
transcript, or protein. To demonstrate that enzymatic 
activity is directly associated with this molecule, the 
putative catalytic domain of the predicted polypeptide 
(residues 43-361 of SEQ ID NO: 2) was fused to a secreted form 
of the igG binding domain of staphylococcus AttrWP protein A 
in the mammalian expression vector pPROTA, to generate the 
vector pPROTA-a(l,3/l,4-Ft) c . Since this fusion protein would 
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lack the putative transmembrane anchoring segment of the 
fucosyltransf erase, the inventor expected it would be 
synthesized as a secreted molecule that could be 
affinity-purified on an IgG-containing matrix and subsequently 
tested for a (1,3) FT activity. COS-1 cells transfected with 
the control vectors pCDM7 or PROTA generated no detectable 

+ 

cell-associated or released enzyme activity. However, 
conditioned media prepared from GOS-1 cells transfected with 
pPROTA-a(l,3/l,4)FT c or with pCDH7-ot(l,3/l,4)FT, contained 
significant quantities of ot(l,3)FT activity. Significantly, 
virtually 100% of the released activity generated by 
pPROTA-ot(l,3/l,4)FT c was specifically retained by the 
IgG-Sepharose matrix, and approximately 24% of this activity 
could be recovered from the matrix after exhaustive washing. 
By contrast, the released activity generated by 
pCDM7-a(l,3/l,4)FT did not interact with the affinity 
adsorbent. These results indicate that the protein encoded by 
this cloned cDNA encodes a fucosyltransf erase, demonstrate 
that information sufficient to generate a (1,3) FT activity 
resides within the enzyme's COOH-terminal 319 amino acids, and 
show that this approach can be used to affinity purify the 
catalytic domain in an enzymatically active state as a portion 
of a bipartite fusion protein. 
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The Fucosvltransferase is a Glycosylated Transmembrane Protein 

In order to confirm the transmembrane topology predicted 
for the enzyme, fucosyltransf erase cRNA was prepared and was 
subjected to analyses by a series of in vitro translation 
experiments. The ^S-methionine-labelled primary translation 
product generated in these experiments migrated with a 
molecular weight of approximately 37,500 Da. The discrepancy 

between this observed molecular weight and the predicted one 

Da 

\ (42,069 px) may be reconciled by the observation that 
membrane-spanning proteins often migrate in an anomalously 
rapid manner through SDS-polyacrylamide gels, relative to 
soluble protein molecular weight markers. When this 
radiolabeled protein was generated by in vitro translation in 
the presence of canine pancreatic microsomes, it migrated with 
an Mr of approximately 42,000 Da. The - 6,000 Da increase in 
molecular mass observed when the translations were performed 
in the presence of microsomes suggests that two core 
glycosylation structures are added by microsomal 
oligosaccharyltransf erase to the two potential 
asparagine- linked glycosylation sites during cotranslational 
translocation across the microsomal membrane. This product 
also co-sedimented with the microsomes, suggesting that the 
protein had become cotranslational ly inserted within, or 
translocated across, the microsomal membrane. When this 
raiolabelled, microsome-associated protein was subjected to 
limit digestion with endog lycos idase H, its molecular mass was 
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reduced to a one essentially identical to that of the primary 
translation product. Partial endoglycosidase H digestion 
generated an additional band of intermediate size, that likely 
consists of molecules that contain a single residual core 
glycosylation unit. These results indicate that the addition 
of core oligosaccharide structures is responsible for the 
increase in size of the protein observed in the co-translation 
experiments. These observations indicate that the two 
potential N-glycosylation sites found within the predicted 
fucosyltranserase amino acid sequence are glycosylated during 
translocation across the microsomal membrane. 

Additional support for the predicted transmembrane 
topology of the fucosyltransf erase was provided by the results 
of protease protection experiments. Co-translation in the 
presence of microsomes yields a 42,000 Da polypeptide that is 
resistant to digestion with protease. The prot ease-digested 
product migrates slightly faster than the undigested, 
microsome-associated polypeptide; this difference is most 
likely accounted for by removal of some or all of the 15 NH 2 - 
terminal amino acids predicted to be displayed on the exterior 
of the microsomes. Addition of microsomes after translation 
yielded a protease-sensitive, nonglycosylated radiolabeled 
product, indicating that membrane insertion of the protein is 
a cotranslational, but not post-translation, event. A small 
amount of a - 34KDa polypeptide that is protease-sensitive and 
glycosylated can also be identified in these experiments. The 
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precise nature of this protein is unknown, but it appears in a 
proteinase K concentration-dependent manner. The inventor 
therefore suspected that it represents a proteolytic fragment 
of the intact protein generated when the integrity of some 
microsomal vesicles is disrupted . In aggregate, these 
experiments indicate that the bulk of this polypeptide can be 
sequestered within the microsomal lumen by a cotranslational 
translocation process, ultimately yielding a product that is 
N-glycosylated. These results are consistent with the type II 
transmembrane 

topology predicted by the fucosyltransf erase sequence. 

Tlie Fucosvltransfer ase Can Construct Two Distinct giycosj<Uc 
fclnftatqes 

It is believed that, in general, each glycosyltransf erase 
is competent to perform a single transglycosylation reaction, 
that in turn generates a single glycosidic linkage. However, 
genetic and biochemical studies indicate that the human Lewis 
blood group locus may determine expression of a single 
fucosyltransf erase capable of generating subterminal Fuca(l,3) 
and Fuca(l,4) linkages on several type I and type II acceptor 
substrates. In particular, the Lewis enzyme is thought to be 
the only human fucosyltransferase capable of using the 
acceptor 2'-fucosyllactose. Since plasmid pCDM7-ot(l,3/l,4)FT 
was isolated with an enrichment scheme involving an enzymatic 
assay based upon this acceptor substrate, it therefore seemed 
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likely that its cDNA insert encodes the Lewis enzyme. To 
confirm this, the inventor performed a series of analyses to 
determine the acceptor specificities of the recombinant 
enzyme . 

Extracts of COS-1 cells transfected with 
pCDM7-a(l,3/l,4)FT were tested for their ability to catalyze 
transglycosylations between GDP-[ u C]fucose and the type I 
acceptor lacto-N-biose I, or the type II acceptors lactose and 
N-acetyllactosamine. There are only two possible classes of 
monofucosylated products that can be formed from each of these 
acceptors by known human fucosytransf erases* These are 
H-active products containing a Fucor(l,2) linkage on the 
terminal Gal of these molecules, or Lewis x- or Lewis a-active 
products containing fucose linked in alpha anomeric 
configuration to the subterminal monosaccharide of these 
acceptors via either the monosaccharide's C4 hydroxy 1 (type I 
acceptor) or its C3 hydroxy 1 (type II acceptors) . The 
inventor therefore fractionated the reaction products with a 
descending paper chromatography method that could distinguish 
between the two classes of reaction products possible with 
each acceptor, and thus allow determination of enzyme 
specificity. 

The inventor found that lactose was utilized by the 
recombinant enzyme to form a radiolabeled compound with the 
chromatographic mobility characteristic of the Lewis x 
trisaccharide 3 -fucosyl lactose, and distinct from the other 
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possible product, the type II H tri saccharide. Likewise, 
these extracts also generated 3-fucosyl-N-acetyllactosamine 
when N- acetyl 1 act osamine was used as an acceptor. 
Radiolabelled fucose was quantitatively released from each 
product upon digestion with oe-fucosidase, indicating that the 
enzyme attaches fucose to these acceptor substrates in alpha 
anomeric configuration. These results are consistent with the 
flow cytometry observations indicating that pCDM7-cr (1,3/1,4) FT 
determines expression of the Gal/} (1,4) [Fucal,3]GlcNAc linkage 
representing the SSEA-1 determinant. 

Moreover, the radiolabelled product of the type I 
acceptor lacto-N-biose I chromatographed with a mobility 
distinct from the H active standard 2 ' -f ucosy 1 lacto-N-biose I 
and consistent with its identity as the Lewis a trisaccharide 
4-fucosyllacto-N-biose I. This product was also susceptible 
to digestion with oe-fucosidase. Identical results were 
obtained for all three disaccharide acceptors when 
affinity-purified protein A-fucosyltransf erase was used in 
these experiments. Taken together, these results indicate 
that the recombinant fucosyl transferase can construct both 
Fuca(l,3) and Fuca(l,4) glycosidic linkages on type II and 
type I disaccharide acceptors, respectively. 

In a complementary set of analyses, type I and type II 
blood group H trisaccharides were tested as acceptors for the 
enzyme encoded by the f ucosy ltransf erase cDNA. Radiolabelled 
type I and type II H molecules were prepared by fucosylating 
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their disaccharide precursors at the C2 hydroxyl of their 
terminal galactose residues, using cell extracts containing 
the blood group H a (1,2) -FT and GDP[ K C]fucose. These 
HPLC-purif ied radiolabeled type I and type II H acceptors 
were then each used in reactions containing unlabel led 
GDP-fucose and affinity-purified fucosyltransf erase activity 
generated by pPROTA-ot(l,3/l,4)FT c . HPLC analysis of these 
reactions identified new radiolabeled compounds 
withchromatographic mobilities predicted for the Levis b 
tetrasaccharide and the Levis y tetrasaccharide, generated 
with the type I or type II acceptors, respectively. Virtually 
identical results were obtained with extracts prepared from 
cbs-l cells transfected vith pCDM7~a(l,3/l,4)FT. Results of 
these experiments and similar ones, are summarized in 
Table 2. 

In a third series of experiments, the inventor 
demonstrated that this enzyme can operate on "type I" or "type 
II 1 ' acceptors vhose terminal galactose residues are 
substituted vith sialic acid in a (2, 3) linkage, to generate 
the sialyl Levis x and sialyl Levis a tetrasaccharide 
determinants. Flov cytometry analysis of COS-1 cells 
transfected vith pCDM7-a(l, 3/ 1,4) FT and stained vith a 
monoclonal anti-sialyl Levis x antibody indicates that this 
plasmid can determine surface expression of the sialyl Levis x 
antigen (Fig. 8), that is the product of oc(l,3)FT action on 
"type II" acceptors vhose terminal galactose residues are 
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substituted with sialic acid in a (2, 3) linkage. Likewise, 
COS-l cells transfected with pCDM7-a (1, 3/1, 4) FT and stained 
with a monoclonal anti-sialyl Lewis a antibody indicates that 
this plasmid can determine surface expression of the sialyl 
Lewis a antigen (Fig. 8), that is the product of a(l,4)FT 
action on type I acceptors whose terminal galactose residues 
are substituted with sialic acid in a (2, 3) linkage. These 
analyses indicate that the fucosyltransf erase encoded by 
pCDM7-ct(l, 3/1,4) FT is able to construct two distinct 
glycosidic linkages on the subterminal 61c or GlcNAc of type I 
and ty£e II acceptors, and that this does not depend upon the 
a(l, 2) fucosylation or a(2 , 3) sialylation status of the terminal 
galactose on these acceptors. These properties mirror those 
reported for the fucosyltransf erase activities determined by 
human Lewis blood group locus, and confirm that a single 
fucosyltransf erase can catalyze the formation of two distinct 
glycosidic linkages. 

The Fucosyltransf erase cDNA Identif y** ff tym an Genomic Sequences 
Syntenic to the H uman Lewis Blood Group Locus 

Genetic data indicate that the human Lewis blood group is 
determined by a locus on chromosome 19. The 
fucosyltransf erase cDNA was therefore used for southern blot 
analysis of a pair of human-mouse somatic cell hybrids that 
differ only by the presence or absence of human chromosome 19. 
The results indicate that at high stringency, the 
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fucosyltransf erase cDNA identifies cross-hybridizing sequences 
located on chromosome 19. Taken together with the enzymatic 
analyses, these data strongly suggest that this cloned cDNA 
represents the product of the human Lewis blood group locus. 

Experimental Procedures for Example III. "Isolation of a 

cloned human cDNA that encodes a GDP-Fuc:fl-D-Gal 

(1 . 4 / 1 , 3 > -D-GlcNAc ( /Glc \ -a (1 . 3 / 1 . 4 1 -f ucosvltransf erase . 

cDNA Library Construction. A cDNA library Was prepared from 
poly (A) -plus RNA isolated from human A431 cells, using 
standard procedures and the mammalian expression vector pCDM7. 
pCDM7 lacks polyoma sequences present in the vector pCDM8, but 
is otherwise virtually identical. The library contained 2.6 x 
10 6 independent recombinants. 

Cell Lines, Mouse 3T3 -human hybrid cell lines KLEJ-47 and 
KLEJ-47/P-1 were obtained from Dr. Howard Green (Harvard 
University, Boston) . Mouse 3T3 cells were from Dr. Vishva 
Dixit (University of Michigan, Ann Arbor) . The origins of all 
other cell lines, and conditions for cell growth are as 
previously described in the literature. 



Preparation of Panning Dishes. Panning dishes were prepared 
by first coating them with goat anti-mouse IgM, and then with 
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monoclonal anti-SSEA-1 antibody (ascites generously provided 



by D. Solter, diluted 1:1000). 



cDNA Library Screening. The A431 library was screened as 
described previously. Plasmid DNA was rescued from 
transfected COS-1 cells adherent to panning dishes and 
introduced into the bacterial host MC1061/P3 by 
transformation. Transformants were grown to saturation in 
liquid culture under antibiotic selection, aliguots were 
removed for frozen storage, and the remainder of the culture 
was used to prepare plasmid DNA. A portion of this plasmid 
DNA was used for subsequent enrichment by transf ection and 
immunose lection on anti-SSEA-1 panning dishes. 

FACS Analysis. Transfected COS-1 cells were stained with the 
mouse IgM anti-SSEA-1 (anti-Lewis x) monoclonal antibody 
(1:1000 dilution of ascites) mouse monoclonal IgM anti-H or 
anti-Lewis a antibodies (Chembiomed, Ltd. , Edmonton; 10 
/xg/ml) , a mouse monoclonal IgM anti-sialyl Lewis x antibody 
(CSLEX, P. Terasaki, lOug/ml) , or a mouse monoclonal IgG anti- 
sialyl Lewis a antibody (CSLEA, P. Terasaki, 1:1000 dilution 
of ascites) . Cells were then stained with f luorescein- 
conjugated goat anti-mouse IgM (Sigma; 40 ng/ml) and subjected 
to analysis by fluorescence activated cell sorting as 
described previously in the literature by the inventor. 
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Northern and Southern Blotting. A431 cell RNA was subjected 



to Northern blot analysis as previously described. The probe 
consisted of a 1.7 kb Xhol * Xbal fragment isolated from the 
5' end of the cDNA insert in plasmid pCDM7-a(l, 3/1,4) FT. This 
fragment does not contain the portion of this cDNA that 
exhibits sequence similarity to human Alu sequences. This 
probe was labelled by nick translation with af^PjdCTP to a 
specific activity of 6 x 10 8 cpm/Mg* 

Genomic DNA was prepared and subjected to Southern 
blotting as described previously. Blots were subjected to a 
final wash for 30 minutes at 65 °C in 0.1X SSC, 0.5% SDS. The 
probe used was identical to the one used for Northern blot 
analysis except that it was labelled with the random priming 
method. 

Sequencing. The cDNA insert in plasmid pCDM7-ot(l f 3/1, 4) FT was 
sequenced by the chain termination method using a double 
stranded plasmid DNA template and commercial reagents 
(Pharmacia) . Both strands were sequenced using 17-mer or 
19-mer oligonucleotide probes synthesized according to the 
sequence of the cDNA insert. The DNA sequence was assembled 
and analyzed using the Beckman Microgenie package, and the 
Sequence Analysis Software Package of the University of 
Wisconsin Genetics Computer Group. 
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In Vitro Transcription-Translation . Plasmid 
pCDM7-a (1,3/1, 4) FT DNA was linearized downstream from the 
cloned cDNA insert by digestion with Notl. Capped RNA 
transcripts were then generated from this linearized template 
using a T7 polymerase promoter based in vitro transcription 
kit (Pharmacia) . Transcripts initiate from the T7 promoter 
proximal to the cDNA cloning site in pCDM7. RNA transcripts 
produced in vitro were used to program a rabbit reticulocyte 
lysate in vitro translation system (Pr omega) , in the presence 
of 35 S-methionine (Amersham) , according to the manufacturer's 
instructions. Membrane-associated radiolabeled in vitro 
translation products, generated in the presence of canine 
pancreatic microsomal membranes (Promega) (cotranslation) or 
generated prior to the addition of microsomes 
(post-translational microsome addition) , were isolated from 
the bulk of the soluble reaction components by centrifugation 
through a sucrose cushion (0.5 M sucrose, 10 mM Tris 7.4, 150 
mM NaCl; 170,000 x g for 60 min) . For endoglycosidase H 
digestions, pellets containing microsome- associated 
radiolabelled products were first resuspended in 50 mM sodium 
citrate pH 5.5, were made 0.2% in SDS, and were heated to 
100 °C for 4 minutes* Aliguots of this material were then 
diluted with an equal volume of water and subjected to 
digestion with either 10 mU or 5 mU of endoglycosidase H for 
20 hrs at 37 °c, in the presence of 0.1% BSA, 0.5% Triton 
X-100, 0.5 mM PMSF, 40 nq/nl Bestatin, 10 Mg/*1 <* 2 
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macroglobulin, and 30 tiq/ml of E-64. Alternatively, the 



buffer containing 5 mM CaCl 2 , and were subjected to incubation 
with 150 /ml proteinase K, on ice for 1 hour, in the 



radiolabeled in vitro translation products were then 
denatured and reduced by heating to 100 °C for 4 minutes in 
62.5 mM Tris pH 6.8, 100 mM dithiothreitol, 2% SDS, 10% 
glycerol, and 0.02% bromphenol blue. Samples were then 
fractionated through SDS polyacrylamide gels, and the gels 
were subjected to autoradiography. 



Fiicosvltransf erase Assays. Cultured cells were washed in PBS, 
harvested by scraping with a rubber policeman, washed again in 
PBS, and pelleted by centrifugation. Cell extracts were 
prepared by resuspending cell pellets in 1% Triton X-100 such 
that the final protein concentration in the extracts was 
approximately 5 mg/ml (BCA method, Pierce Chemical Co,)* 

Fucosyltransferase assays were performed in 50 mM MOPS pH 
6.5, 25 mM MnCl 2 10 mM L-fucose, 5 mM ATP, 3 mM GDP-[ U C] fucose 
(specific activity of 600,000 cpm/nmol; 35,000 cpm per assay), 
2.5 mM acceptor (e.g. 2'-fucosyllactose, N-acetyllactosamine, 
lactose or lacto-N-biose I) , and up to 10 Ml of cell extract, 
in a final volume of 20 /U- determining a (1,3) FT 

specific activities achieved during the sib selection process, 
and in the analysis of the protein A-fucosyltransferase fusion 



pellets were resuspended in ice cold in vitro translation 



presence or absence of 1% Triton X-100. The various 
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protein experiments, the amount of added cell extract and 

* 

incubation times were adjusted to yield (linear) reaction 
* rates reflecting accurate specific activities. Reactions were 

incubated at 37 °C for 2 or 6 hours, and then terminated by the 
addition of 20 Ml of ethanol, followed by dilution with 500 Ml 
of H 2 0. The terminated, diluted reactions were then 
centrifuged at 15,000 x g for 5 min. Fifty Ml of each 
reaction supernatant was counted to determine total 
radioactivity, and 200 Ml of each was fractionated by Oowex-1 
chromatography. The neutral radiolabeled material eluting 
from the column was then counted directly as a measure of 
product formation. Enzyme specific activity is defined as 
pBiol of fucose transferred from GDP-fucose to acceptor per mg 
cell extract protein per hour* Neutral products were also 
further analyzed as described below by descending paper 
chromatography and by HPLC to confirm their identity. 
Parallel reactions were done in the absence of added acceptor 
to allow correction for transfer to endogenous acceptor 
molecules and for substrate and product hydrolysis. These 
control experiments indicated that less than 2.6% of the 
radioactivity in GDP- [ 14 C] fucose was found as a neutral product 
in the absence of added acceptor, and that virtually all of 
this material represented [ 14 C] fucose. 
L In instances where radiolabeled, H type I and H type II 

molecules were used as acceptors, nonradiolabelled GDP-fucose 
was included instead of GDP«r [ M c ] fucose , and reactions were 
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allowed to proceed for 16 hours. Residual unreacted neutral 
radiolabeled acceptor substrate, and neutral radiolabelled 
product were isolated by Dowex-1 chromatography and then 
analyzed by HFLC. 

Preparation of Ra diolabelled H Type I and H Type II Accentors. 
Cell extracts containing a human a(l,2)FT activity were used 
to synthesize radiolabelled type I H or type II H acceptor 
molecules. The cell extracts were prepared from mHl-12 cells, 
a mouse L cell transfectant containing a human DNA segment 
that encodes a human a(l r 2)FT. These extracts contain no 
detectable a (1,3) FT activity or or (1,4) FT activity. 
Lacto-N-biose I (20 mM) , or N-acetyllactosamine (20 mM) , were 
incubated with 100 M9 of mHl-12 extract protein in 40 /il of 25 
mM sodium cacodylate pH 6.2 containing 3 MM GDP-[ H C]fucose, 
for 16 hours at 37 °C. Reactions were terminated by the 
addition of 40 pi of ethanol followed by dilution with 200 Ml 
of water. Precipitated protein was removed by centrifugation 
at 12,000 x g for 5 minutes, and the neutral radiolabelled 
reaction products in the supernatant were isolated by Dowex-1 
chromatography. The type I H trisaccharide molecules 
( lacto-N-biose I reaction) or type II H trisaccharide 
molecules (N-acetyllactosamine reaction) comprising the 
majority of the respective neutral radiolabelled materials 
were then purified by HPLC as described below. 
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Product Analysis by HPLC and Descending Paper Chromatography. 
Neutral radiolabelled reaction products generated by 
affinity-purified protein A-fucosyltransf erase fusion protein, 
or by pCDM7 -a (1,3/1, 4 FT) -programmed COS-1 extracts, type I or 
type II disaccharide acceptors, and GDP-[ 14 C]fucose (see above, 
Pucosyltransf erase Assays) were fractionated by descending 
paper chromatography or by HPLC chromatography to determine 
their structures. Samples analyzed by HPLC were dissolved in 
70% acetonitrile and applied to a Dynamax 60A (primary amine 
column, Rainin Instruments, 4.14 mm x 25 cm) equilibrated in 
acetonitrile: water (70:30). Compounds were eluted with a 
linear gradient of acetonitrile: water (70:30 to 40:60), in 1 
hour, at a flow rate of l ml per minute. The eluant was 
monitored with a Beckman Instruments on-line radioisotope 



Samples analyzed by descending paper chromatography were 
dissolved in water and fractionated through Whatman No. 40 in 
phenol /isopropanol/ formic acid/water (85:5:10:100, lower 
layer). After chromatography (Figure 6; 40 hours in panel A or 
48 hours in panel B) , air-dried chromatograms were cut into 1 
cm strips and the radiolabelled compounds were eluted into 2 
ml of water. The radioactivity in each eluate was determined 
by scintillation counting after mixing with 10 ml of 
scintillation cocktail. HPLC-purif ied u C-labelled type I and 
type II H-active trisaccharide standards were prepared as 
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described above for the preparation of u C-labelled type I and 



qVL-Fucosidase Digestion. Neutral, HPLOpurif ied, 
radiolabelled fucosyltransf erase products were subjected to 
a-L-fucosidase digestion to confirm the alpha anomeric 
configuration of the attached fucose* 



( 1 , 3 ) [ ,4 C ] f ucosy 1-2 ' -f ucosy llactose , 

(It 3) [ H C]fucosy 1 lactose, and (1,4) [ 14 C]fucosyllacto-N-biose I 
were purified by HPLC, and aliquots of each (10,000 to 20,000 
cpms) were digested with 100 mU of a-L-fucosidase (E.C. 
3.2.1.51, Boehringer-Mannheim) in 70 Ml of 100 mM Na citrate 
pH 5.5, at 37 °C for 22 hrs. The reactions were desalted by 
Dowex column chromatography and subjected to HPLC analysis 
using conditions described above* The products of the 
digestion were identified by comparison to parallel 
separations of L-[ U C] fucose and [ 14 C]fucose-labelled acceptors. 
In each case, quantitative release of L-[ 14 C] fucose was 
achieved by a-L-fucosidase digestion. 



Smal-BamHl segment of the cDNA insert containing the putative 
fucosyltransferase catalytic domain was isolated from 
pCDM7-a(l,3/l,4)FT. This fragment was blunt-ended with the 



type II H-active acceptor trisaccharides . 



( 1 , 3 ) ( 14 C ] f ucosy 1-N-acetyllactosamine , 



PPROTA- 




A 1344-bp 
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Klenow fragment of DNA polymerase I, and the ends were ligated 



to kinased double stranded linkers (5' CGGAATTCCG 3')- The 



gel purified again. This fragment was inserted at the unique 
EcoRI site of FPROTA. One plasmid (pPROTA-a(l,3/l,4)FT c ) 
containing a single insert in the appropriate orientation was 
analyzed by DNA sequencing to confirm the predicted sequence 
across the junctions between the vector, linker, and insert. 

Plasmids pPROTA-ot(l,3/l,4)FT c , pCDM7-ot(l, 3/l,4)FT, or 
pPROTA, (50 fsq each) were separately introduced into COS-1 
cells (500,000 per 100 mm dish) by DEAE-dextranmediated 



transf ection. After a 72 -hour expression period, the media 
(10 ml) was harvested from each plate and subjected to low 
speed (300 X 6 for 8 min) and high speed (100,000 X G for 1 h) 
centrif ugations . The supematants were then adjusted to 0*05% 
Tween 20 and were either assayed directly, or were used in 
IgG-Sepharose binding studies. IgG-Sepharose or Sepharose 6B 
were preequi libra ted as described by the manufacturer 
(Pharmacia) , and then equilibrated in 10% fetal calf serum in 
Dulbecco's modified Eagle's medium (FCS/DMEM) . Aliquots of 
processed supernatants containing known amounts of enzyme 
activity prepared from transfected COS-1 cells were incubated 
batchwise with 100 pi of equilibrated IgG-Sepharose or 
Sepharose 6B, overnight at 4°C. Supernatants were saved for 
assay ("Flow thru" activity) . The matrices were then washed 
by centrifugation, 9 times with 1 ml of 50 mM Tris pH 7.5, 1 



ligated fragment was gel purified, digested with EcoRI, and 
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mg/ml bovine serum albumin, twice with 1 ml of 20 mM Tris pH 
7.5 , 5 mM CaCl 2) 0.05% Tween-20, and once with FCS/DMEM. The 
matrices were then resuspended in an .equal volume of FCS/DMEM. 
This suspension was used directly for assay of a (1,3) FT 
activity. 

Example IV. Cloning by cross-hybridization, an d expression. 

of a DNA sequence encoding GDP-FuC:fl-D- 
Gal f 1 . 4 ) -D-GlcNAc a ( 1 . 3 ) fucosvl transferase 
fFuc-TTVl (DNA SEP IP NO: 7. Protein SEP ID 

The inventor had previously used a mammalian gene 
transfer procedure to isolate a clone cDNA (SEQ ID N0:1) that 
encodes the human Lewis blood group fucosyltransf erase (SEQ ID 
NO: 2). The inventor was aware of biochemical and genetic data 
indicating that the human genome contains two or more other 
structural genes that encode fucosyltransferases competent to 
construct surface localized Lewis x determinants 
( Gal01-»4 [ Fuca ( l-»3 ) ] GlcNAc- ) . These other enzyme ( s ) were 
thought to be polypeptides distinct from the Lewis 
fucosyltransf erase because they exhibit different acceptor 
substrate specificities and differential sensitivities to 
divalent cation and N-ethylmaleimide inactivation . Moreover, 
their expression is determined by loci distinct from the Lewis 
blood group fucosyltransf erase locus, and they display tissue 
specific patterns that are different from expression patterns 
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determined by the Lewis locus. Because these enzymes exhibit 
properties that are very similar to the Lewis blood group 
* fucosyltransf erase, the inventor considered it possible that 

their corresponding genes might be sufficiently related at the 
primary sequence level to be able to isolate them by cross- 
hybridization approaches. He considered this even though he 
and others had previously shown that glycosyltransferase 
sequences that use the same substrates are not at all related 
in their primary nucleic acid or amino acid sequences, since 
he knew that the fucosyltransferases exhibited very similar 
substrate requirements, and in each case constructed one or 

i 
i 

more oligosaccharide products identical to those made by the 
Lewis fucosyltransf erase. 

In consideration of the possibility that these 
a (l f 3) fucosyltransferases might be encoded by a family of 
structurally-related genes, he sought to isolate other such 
members by cross-hybridization methods, using the cloned Lewis 
fucosyltransf erase cDNA. Low stringency Southern blot 
hybridization experiments indicate that the coding region of 
the Lewis a (1, 3) fucosyltransf erase cDNA detects strongly 
hybridizing human DNA restriction fragments, as well as 
several weakly hybridizing fragments. Weakly hybridizing 
fragments were always detected regardless of the restriction 
4 enzyme used, suggesting that these represented one or more DNA 

sequences distinct from the authentic Lewis gene presumably 
represented by the strongly-hybridizing fragments. To further 
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examine the molecular nature of these sequences, he screened a 
human lambda phage genomic DNA library at low stringency with 
the Lewis cDNA probe. A total of 18 phages were isolated from 
phages representing approximately five human genomic 
equivalents. Southern blot analysis of 16 of these phages 
allowed them to be placed into three groups, based upon their 
restriction patterns and hybridization signal intensity 
strengths. Six phages representing a class of intermediate 
hybridization intensity were identified. A 3.6 kb cross- 
hybridizing PstI restriction fragment was subcloned from a 
representative phage of this class. To determine the 
relationship of this fragment to cross-hybridizing fragments 
detected in human genomic DNA with the Lewis probe, a 400 bp 
Avall-PvuII segment of this fragment, that cross-hybridized 
with the Lewis coding sequence probe, was also used to probe 
Southern blots at low stringency. For each enzyme used to 
generate the Southern blots, the Avall-PvuII probe detected 
one strongly hybridizing fragment, and one or more weakly 
hybridizing fragments. Each strongly hybridizing fragment 
corresponded to one of the weakly hybridizing fragments 
generated by the same enzyme and detected by the Lewis probe. 
Likewise, the strongly hybridizing fragments detected with the 
Lewis probe correspond to fragments that exhibit weak 
hybridization to the Avall-PvuII probe. These results 
suggested that this probe, and the 3.6 kb fragment from which 
it was derived, represented the weakly cross-hybridizing DNA 
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sequences detected by the Levis probe on genomic DNA Southern 



blots . 



The homologous DNA restricti on fragment maintains a 
single long open reading frame that predicts a polypeptide 
with similarity to the Levis blood group 

a ( l . 3 / 1 . 4 ) fucosvltransf erase cDNA . DMA sequence analysis of 
the 3.6 kb PstI fragment (SEQ ID NO: 7) identified a single 
long open reading frame vithin its 3' portion corresponding to 
sequences that cross-hybridized to the Levis cDNA probe (Fig* 
4 and Fig. 5) . This reading frame begins with a methionine 
codon that is found vithin a sequence context consistent with 



Kozak's consensus rules for mammalian translation initiation. 
Moreover, hydropathy analysis of the protein sequence 
predicted by this reading frame indicates a single hydrophobic 
segment at its NH 2 - terminus , suggesting that the predicted 
polypeptide (SEQ ID NO: 8) vould maintain the type II 
transmembrane orientation typical of mammalian 
glycosyltranf erases . The distal portion of this reading frame 
shares a substantial amount of amino acid sequence identity 
vith the corresponding portion of the Levis fucosyltransferase 
(Fig. 5) . These sequences share the highest degree of 
similarity betveen their C00H- terminal portions, vithin the 
catalytic domain of the Levis fucosyltransferase. Sequence 
divergence occurs tovard the predicted NHj-end, vithin the 
"stem" and transmembrane regions of the latter enzyme. 
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The DNA restriction fragment de tects mRNA transcripts in 
HL-60 myeloid cells. To test the possibility that this 
segment represents a functional a(l,3)fucosyltransferase gene, 
a portion of it was used as a probe to identify transcripts in 
a cell line known to express such enzymes. The HL-60 human 
cell line was examined since these myeloid lineage cells are 
known to express one or more a (1, 3) fucosyltransf erases that 
are distinct from the Lewis a (1,3) fucosyltransf erase. 
Northern blot analysis of polyadenylated mRNA isolated from 
these cells, using the 400 bp Avall-PvuII segment 
corresponding to a portion of the open reading frame, 
identified four distinct transcripts. By contrast, no 
transcripts were detected when the same analysis was preformed 
using the Lewis cDNA. These results are consistent with the 
possibility that the fucosyltransf erase (s) expressed by HL-60 
cells are encoded by the open reading frame in the cloned PstI 
segment • 

The open reading frame in the homolog ous DNA restriciton 
fragment determin es expression of an qfl , 3) fucosyltransf erase, 
To determine if this segment encodes an 

a (1,3) fucosyltransf erase, the 3.6 kb PstI fragment was cloned 
into a mammalian expression vector and the resulting plasmid 
(pCDNAl-Fuc-TIV, "Experimental Procedures") was introduced 
into two types of mammalian host cells by transfection. 
Transfected cells were then analyzed for vector-dependent cell 
surface glycocon jugate expression and for fucosyltransferase 
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activity. COS-1 cells and CHO cells were used as hosts for 
these experiments since neither cell line normally expresses 
any detectable a (1,3)- and a(l, 4) fucosyltransf erase 
activities. Likewise, COS-1 and CHO cells do not normally 
express detectable amounts of cell surface 
Galj3l-»4 [Fucot( l->3 ) ]GlcNAc-( Lewis x, SSEA-1) moieties, or the 
a2-»3 sialylated derivative (NeuAca2-OGal/Jl-*4 [Fuca(l-* 
3)]GlcNac-, sialyl Lewis x) . These cells do, however maintain 
surface-display of the non-fucosylated neutral and a2-*3- 
sialylated type II oligosaccharides that can function as 
precursors to such molecules, via the action of the 
a (1,3) fucosyltransf erase encoded by a transfected Lewis cDNA 
expression vector. COS-1 cells also maintain surface display 
of the type I precursors to the Lewis a 
(Galj3l-*3[Fuca(l->4) ]GlcNAc-) and sialyl Lewis a 
(NeuNAca2-3Gal/Jl-0[Fuca(l-4) ]GlcNAc-) moieties. The vector 
pCDNAI was used since this plasmid efficiently transcribes 
exogenous, subcloned sequences in mammalian hosts by virtues 
of the cytomegalovirus immediate early promoter sequences in 
the vector. 

In initial biochemical analyses, extracts prepared from 
COS-1 cells transfected with plasmid pCDNAI-Fuc-TIV were 
tested for the presentee of vector-dependent 
fucosyltransf erase activity, using several low molecular 
weight acceptor substrates. In a standard fucosyltransf erase 
assay ("Experimental Procedures") , extracts prepared from 
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pCDNAl-Fuc-TIV transfected cells, but not from control 
transfectants , contained a fucosyltransf erase activity (296 
pmol/mg-h) that utilized the type II disaccharide acceptor N- 
acetyllactosamine to yield a radiolabeled product with a 
chromatographic mobility ("Experimental Procedures") 
characteristic of authentic Gal/Jl->3[Fuca(l-*4) ]GlcNAc 
(^wm " °- 85 >- However, under these assay 
conditions, two other neutral type II molecules 
(2 'fucosyl lactose, lactose) did not function as efficiently as 
N-acetyllactosamine as acceptor substrates for the 
fucosyltransf erase in these extracts (17 and 10 pmol/mg-h, 
respectively, for 2 '-fucosyl lactose and lactose) . Only a 
trace amount of transfer could be detected using the type I 
substrate lacto-N-biose I. 

Likewise, the inventor did not detect fucose transfer to 
the sialylated acceptor NeuAcoc(2-0) Gal/3 (l-*4)GlcNAc (less than 
1 pmol/mg-h) , even in extracts that exhibited a relatively 
large amount of activity toward N-acetyllactosamine (474 
pmol/mg-h). By contrast, under these same conditions, 
extracts containing the Lewis blood group fucosyltransf erase 
utilized both the sialylated acceptor (297 pmol/mg-h) and N- 
acetyllactosamine (526 pmol/mg-h) , to form, respectively, the 
sialyl Lewis x tetrasaccharide and the Lewis x trisaccharide 
(see "Experimental Procedures"). Thus, the restricted 
acceptor preference exhibited by this enzyme in vitro 
contrasts remarkably with that exhibited by the Lewis 
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a (1, 3/1, 4 ) fucosyltransf erase, which can efficiently utilize 



each of the five acceptors tested. These results are 
summarized in Table 2. 

a 

COS-1 cells transfected with pCDNAl-Fuc-TIV were also 
analyzed by flow cytometry to detect de novo, vector-dependent 
surface expression of these oligosaccharide products, to allow 
an assessment of the enzyme's in vivo acceptor substrate 
requirements. The transfected COS-1 cells exhbited positive 
staining with a monoclonal antibody directed aginst the Lewis 
x moiety Gal01-*4[Fuca(l-O) JGlcNAc-) (Fig. 8), whereas cells 
transfected with the pCDNAl vector without insert did not 



express this determinant. However, COS-1 cells transfected 
with pCDNAl-Fuc-TIV, or with its control plasmid, did not 
stain with antibodies specific for the sialyl Lewis x antigen 
(Fig. 8) . Likewise, the transfected cells did not exhibit 
detectable surface expression of Lewis a or sialyl Lewis a 
molecules (Fig. 8) . 

Polylactosaminoglycans with terminal a (2-^3) -linked sialic 
acid also exist that maintain a single internal a (1,3) -linked 
fucose on the N-acetylglucosamine residue of the pneultimate 
lactosamine repeat. This determinant (NeuAca2-»3Gal01-» 
4GlcNAc01-»3Gal/Jl->4[Fuca(l-O) ]GlcNAc-) can be detected on the 
surfaces of myeloid cells by the monoclonal antibody VIM-2, 
and may be constructed by the action of 
a (1, 3) fucosyltransferase(s) on type II polylactosamine 
acceptors whose terminal galactose residues are substituted 



* 
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with a (2 , 3) sialic acid moieties. Neither COS-1 cells 
transfected with the Lewis ot(l, 3/1,4) fucosyltransf erase, nor 
COS-1 cells tranfected with plasmid pCDNAl-Fuc-TIV display 
detectable amounts of this determinant. 

Virtually identical results were obtained with COS-l 
cells transfected with the plasmid pCDNAI-a(l,3)FTMlu 
( " Experimental Procedures"). This vector encompasses 
sequences corresponding to bp-1904 through the end of the open 
reading frame in Fig. 4. These results provide additional 
evidence for the hypothesis that the open reading frame 
displayed in Fig. 4 corresponds to the coding portion of this 
fucosyltransf erase gene. 

To further demonstrate that enzymatic activity is 
directly associated with this protein, the putative catalytic 
domain of the predicted polypeptide (amino acids 50 to 405 of 
SEQ ID NO: 8) was fused to a secreted form of the IgG binding 
domain of Staphylococcus aureus protein A in the mammalian 
expression vector pPROTA, to generate the vector pFROTA-a(Fuc- 
TIV) . Since this fusion protein would lack the putative 
transmembrane anchoring segment of the fucosyltransf erase, the 
inventor expected it would be synthesized as a secreted 
molecule that could be affinity-purified on an IgG-containing 
matrix and subsequently tested for a(l,3)FT activity. COS-1 
cells transfected with the control vectors pCDM7 or pPROTA 
generated no detectable cell-associated or released enzyme 
activity. However, conditioned media prepared from COS-1 
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cells transfected with pPROTA-a (Fuc-TIV) c contained significant 
quantities of a(l,3)FT activity when assayed with N- 
acetyllactosmine. Virtually 100% of the released activity 
generated by pPROTA-a (Fuc-TIV) c is specifically retained by the 
IgG-Sepharose matrix. These results indicate that the protein 
encoded by this cloned DNA segment encodes a 
fucosyltransf erase, and demonstrate that information 
sufficient to generate a(l,3)FT activity resides within the 
enzyme's COOH-terminal 356 amino acids. 

Biochemical analysis of extracts prepared from a CHO cell 
line transfected with pCDNAl -Fuc-TIV (CHO-FT3 cells) yielded 
results similar to those obtained with the transfected COS-1 
cells. In the standard fucosyltransf erase assay (Experimental 
Procedures) , extracts prepared from the control transfected 
cell line CHO-V did not contain a (1,3) fucosyltransf erase 
activity when tested with N-acetyllactosamine, 2'- 
fucosyl lactose, lactose, or lacto-N-biose I, or 
NeuAca (2-3) Gal(l— 4)GlcNAc. By contrast, extracts prepared 
from the CHO-FT3 line contained an a (1,3) fucosyltransf erase 
activity (59.1 pmol/mg-h) that utilized the type II 
disaccharide acceptor N-acetyllactosamine to yield a 
radiolabeled product characteristic of authentic 

GaljS ( 1^4 ) ( Fuca ( 1-3 ) ] GlcNAc (R 2 ' -f ucooyi-N-acetyllactOBamine ~ 

0.85) (see "Experimental Procedures"). Under these assay 
conditions, the CH0-FT3 extracts utilized the type II acceptor 
molecules 2 ' -f ucosy llactose and lactose with substantially 
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lower efficiency (5,8 pmol/mg-h and 2.0 pmol/mg-h. 



respectively) . Virtually no transfer could be detected when 
these extracts were tested with the type I substrate lacto-N- 
biose I (<1 pmol/mg-h) of with the sialyl Lewis x precursor 
NeuNAcot2-3GaljSl-4GlcNAc (<1 pmol/mg-h) . These results confirm 
those obtained with extracts prepared from the transfected 
COS-1 cells, and indicate that, to a first approximation, the 
COS-l and CHO genetic backgrounds do not strongly influence 
the enzyme's ability to utilize these five low molecular 
weight acceptor substrates* 

With one striking and important exception, flow cytometry 
analyses with the CHO-FT3 cells were virtually identical to 
those obtained with the transfected COS-1 cells* CHO-FT3 
cells exhibit uniform, bright staining with anti-Lewis x 
antibody , but not with antibody directed against the sialyl 
Lewis x molecule. Control transfected cells do not stain with 
either antibody. As expected, neither cell line stained with 
antibodies against the neutral and a2-0-sialylated Lewis a 
isomers, since CHO cells do not construct type I precursors. 
However, these cells differed in an important way from the 
transfected COS-1 cells, in that, like CHO cells transfected 
with the Lewis a(l, 3/1,4) fucosyl transferase cDNA (pCDM7- 
a(l,3/l,4)FT) , these cells expressed substantial amounts of 
the VIM- 2 determinant. 

Taken together with the results of the biochemical 
analyses performed with extracts from the transfected cells 
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and summarized in Table 2, the flow cytometry analyses 
presented in Fig. 8, the protein A gene fusion experiments, 
and the DNA sequence analysis indicate that plasmid pCDNAl- 
Fuc-TIV encodes an ot(l, 3) fucosyl transferase. Transfection 
results obtained with plasmid pCDNAI-a (1, 3 ) FTMiu also 
demonstrate that this enzyme is encoded by the open reading 
frame displayed in Fig. 4. The results further indicate that 
this enzyme can utilize type II precursors, but not type I 
precursors, and suggest that the enzyme cannot efficiently 
utilize a2-*3-sialylated type II glycoconjugates to form the 
sialyl Lewis x determinant. 

Experimen tal Procedures for Example IV. "Cloning bv 
cross-hvbridization. and expression, of a DNA sequence 
encoding GDP-Fuc : fl-D-Gal f 1 . 4 1 1-D-GlcNAc 
a ( 1 , 3 \ -Fucosvltransf erase : ( Fuc-TIV 1 " 

Cell culture . The source and growth conditions of COS-1 
cells, CHO transfectants, and A431 cells are as previously 
described ( Ernst et al . J, Biol- Chem. (1989) 265:3436-3447; 
Raian et al . J. Biol, Chem. (1989) 264:11158-11167). The 
human HL-60 cell line was obtained from Dr. Steve Kunkel 
(University of Michigan, Ann Arbor) . HL-60 cells were grown 
in 10% fetal calf serum and Dulbeccols Modified Eagle's 
Medium . 

Antibodies . The anti-Lex antibody anti-SSEA-1 Sorter et 
al, Proc. Nat. Acad. Sci. (USA! (1978) 75:5565-5569) (mouse 



WO 97/09421 1 

-204- 

monoclonal IgM as ascites) was used. Anti-H and anti-Lewis a 
antibodies (mouse monoclonal IgM, antigen affinity purified) 
were purchased from Chembiomed Ltd. (Edmonton, Alberta) . 
Anti-sialyl Levis x antibody CSLEX1 f Fukushima et al . Cancer 
Res. (1984) 11:5279-5285) (mouse monoclonal IgM, HFLC 
purified) and anti-sialyl Lewis a antibody CSLEA1 f chia et al . 
Cancer Res . (1985) 4£:435-437) (mouse monoclonal IgG3, 
ammonium sulfate precipitate) were used. Anti-VTM-2 was 
obtained from Dr. Bruce Macher (San Francisco State 
University) . A pooled mouse IgG antibody preparation (Mslg) 
was purchased from Coulter. Fluor escein-conjuga ted goat anti- 
mouse IgM or IgG antibodies were purchased from Sigma* 

HnTnarf g enomic library construction . High molecular 
weight human genomic DNA was prepared from peripheral blood 
leukocytes as described previously ( Ernst et al (1989)). 
Genomic DNA was subjected to partial digestion with the 
restriction endonuclease Sau 3A. The partially digested 
genomic DNA was size fractionated by ultracentrifugation 
through a sodium chloride gradient. Fractions enriched for 
DNA fragments between 8 Kb and 20 Kb were ligated to 3&oI 
digested lambda FIX (Strata gene) phage arms that had been 
partially filled in with dTTP and dCTP to make the ends 
compatible with the j§au3A fragments. The ligation mixture was 
packaged in vitro with commercial packaging extracts 
(Stratagene) , titered on £. coli host TAP90 ( Patter on et al. 
Nucl. Acids Res . (1987) 1§:6298). Approximately 1.0 X 10 6 
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recombinant lambda phage were screened by plaque 
hybridization. Plaque lifts were prepared using 
nitrocellulose filters (Schleicher and Schuell) and were 
prehybridized at 42 °C for 16 hours in 50% formamide, 5X SSC, 
10X Denhart's solution, and 0.1% SDS. Filters were hybridized 
for 72 hours at 35 °C in prehybridization solution containing 
10% dextran sulfate, and 100 micrograms per ml denatured 
salmon sperm DNA. The probe consisted of a 1.7 Kb Xho l- Xba l 
fragment isolated from the 5' end of a cDNA insert encoding 
the Lewis blood group a (1,3/1,4) fucosyltransf erase which was 
labeled with [a- 32 ?] dCTP. The filters were rinsed three times 
for 20 minutes each at room temperature in 2X SSC and then 
once for 40 minutes at 50°C and IX SSC, 0.5% SDS. Filters 
were then subjected to autoradiography. Eighteen independent 
hybridization-positive plaques were identified after 2 
additional cycles of plaque hybridization. Phage DNAs were 
prepared from liquid lysates and were subsequently 
characterized by restriction endonuclease digestions and 
Southern blot analyses. 

DNA sequence analysis . Phage DNA was digested with Pst I 
and a 3.8 Kb fragment homologous to the human 
a(l, 3/1, 4) fucosyltransf erase cDNA was gel purified and ligated 
into the Pst I site of pTZ!8R. A representative subclone 



hybridization-positive Avail -PstI fragment was isolated from 
insert in pFT-3 and subcloned into pTZ18R. The DNA sequence 



containing a single insert was designated pFT-3. A 970 bp 
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of the insert in this plasmid was determined by the dideoxy 
chain determination method using T7 DNA polymerase (Pharmacia 
LKB Biotechnology, Inc.) and oligonucleotides synthesized 
according to flanking plasmid sequences and subsequently 
according to the insert sequence. This sequence data was used 
to generate additional synthetic deoxynucleo tides which were 
then used to sequence portions of the insert in pFT-3. 
Sequence analysis was performed using the sequence analysis 
software package of the University of Wisconsin Genetics 
Computer Group. 

Southern blot analysis . High molecular weight human 
genomic DNA was prepared from whole peripheral blood. Genomic 
DNA (10 fig) was digested with restriction endonucleases , 
fractionated through a 0.8% agarose gel, and subjected to 
Southern blot analysis. To aid in the comparison of 
hybridization patterns obtained with different probes, 
duplicate blots were prepared from identical sets of 
restriction digests electrophoresed on a single gel. Southern 
blots were hybridized with the temperature being maintained at 
35° C. Southern blots were probed with the 1.7 Kb Xho l- Xba l 
fragment of plasmid pCDM7-a(l,3/l,4)FT which represents the 5' 
end of the human cDNA encoding the Lewis 

a(l,3/l,4)fucosyltransferase enzyme. Alternatively, Southern 
blots were probed with a 400 bp Ava ll-PvuII fragment isolated 
from the insert in pFT-3. Following hybridization, blots were 



i 



WO 97/0M21 




/US96/13816 



207- 



rinsed twice in 2X SSC 0.5% SDS at room temperature for 10 
minutes, washed, and then subjected to autoradiography. 

Northern blot analysis. Total RNA was prepared from 
cultured cells Poly A+ RNA was then isolated from total RNA by 
oligo dT cellulose column chromatography using commercially 
supplied columns (Clontech) and procedures supplied by the 
manufacturer. RNA samples were electrophoresed through 1.0% 

* 

agarose gels containing formaldehyde and were then transferred 
to a nylon tembrane (Hybond-N, Amersham) . Northern blots were 
prehybridized for 1 hour at 61°C in IX PE (16), 5X SSC, 1% 
sodium dodecyl sulfate, and 100 jig/ ml sheared salmon sperm 
DNA. Blots were then hybridized for at least 16 hours at 61°C 
in the same hybridization solution. The probe was a 
radiolabelled 400 bp Ava ll- Pvu ll fragment isolated from the 
insert in pFT-3. Following hybridization, blots were 
subjected to three, ten minute room temperature rinses in 2X 
SSC, and then washed for 30 minutes at 62 °C in 2X SSC, 0.2% 
SDS. 

Transf ection and expression of the insert in pFT-3 . The 
3.8 Kb Pst I insert in plasmid pFT-3 was excised and cloned 
into the Pst I site in the mammalian expression plasmid pCDNAl 
(Xnvitrogen) . One plasmid with a single insert in the sense 
orientation with respect to the plasmid 's CMV promoter 
enhancer sequences was designated pCDNAl-Fuc-TIV, and was used 
for subsequent analysis. 
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Construction and radiolabelina of stablv transfected CHO 
cell lines . CHO Ade-C cells were transfected with 
Scal-linearized pCDM7 -Fuc-TIV , co-precipitated in a 10-fold 
molar excess over EcoRI-linearized pSV2-Neo. A single, 
clonal, SSEA-l-positive cell line (CH0-FT3) was derived from 
this population. Cell extracts prepared from CHO-FT3 
contained substantial amounts of a(l f 3) fucosyltransferase 
activity when assayed with the acceptor N-acetyllactosamine. 

FACS analysis , COS-1 cells transfected with plasmid DNAs 
were harvested 48*72 hours after transfection, and stained 
with monoclonal antibodies diluted in staining media. 
Anti-Lewis a and anti-H antibodies (mouse IgM monoclonal; 
antigen-affinity purified; Chembiomed, Edmonton) were used at 
10 jig/ml. Anti-SSEA-l (mouse monoclonal IgM; ascites) was 
used at a dilution of 1:1000. Anti-sialyl Lewis x (mouse 

* 

monoclonal IgM; HPLC purified from ascites) was used at 10 
pg/ml. Anti-sialyl Lewis a (mouse monoclonal IgG3; ammonium 
sulfate precipitate of ascites) was used at a dilution of 
1:1000. Control mouse IgG3 antibody (Mslg, Coulter) was used 
at a concentration of 10 /xg/ml. Anti-VIM-2 antibody (mouse 
monoclonal IgM; ascites) was used at a dilution of 1:200. 
Cells were then stained with fluorescein isothiocyanate- 
conjugated goat anti-mouse IgM or IgG, as appropriate, and 
were then subjected to analysis on a FACScan (Becton- 
Dickinson) . 
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Fucosvl transferase assays . Cell extracts containing l% 
Triton X-100 were prepared from transfected COS-1 cells. 
Fucosyltransf erase assays were performed in a total volume of 
20 pi, and contained 50 mM sodium cacodylate, pH 6.2, 5 mM 
ATP, 10 xnM fucose, 20 mM MnCl 2 , 3 jiM GDP- u C-fucose, and 5 /il 
(30 fig protein) of cell extract. Acceptor substrates were 
added to a final concentration of 20 mM. Reactions were 
incubated at 37 °C for 1 hour and terminated by addition of 20 
/il ethanol, followed by addition of 600 /il of distilled water. 
An aliquot of each reaction (50 /*1) was subjected to 
scintillation counting to determine total radioactivity in the 
reaction. Another aliquot (200 fil) was applied to a column 
containing 400 /il of Dowex 1X2-400, formate form. The flow 
through fraction, and 2 /il of a subsequent water elution, were 
collected and pooled, and an aliquot was subjected to 
scintillation counting to quantitate incorporation of 
radioactive fucose into neutral product. Descending paper 
chromatography was used to confirm the structure of the 
product formed with the acceptor N-acetyllactos amine. The 
neutral product in the Dowex column eluate was concentrated by 
lyophilization, resuspended in a small volume of water, and 
fractionated through Whatman No. 40 in phenol/ isopropanol/ 
formic acid/water (85:5:10:100, lower layer). After 
chromatography (40 hours) , the air-dried chromatogram was cut 
into 1 cm strips and the strips eluted into 2 ml of water. 
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Radioactivity in each eluate was determined by scintillation 



counting after mixing with 10 ml of scintillation cocktail. 

An affinity-purified, protein A-Lewis fucosyltransf erase 
fusion protein was used to prepare an authentic, radiolabeled 
Gal0(l-*4) [ w C-Fuca(l-*3) ] GlcNAc standard for this analysis. 
This fusion protein was incubated with 20 mM 
N-acetyllactosamine, 150 /im GDP- [ 14 C ] f ucose (sp. act. « 3 f 800 
cpms/nmol) , in a standard fucosyltransf erase reaction mixture. 
The neutral, radiolabeled product was purified by amine 
adsorption HPLC on a Waters Carbohydrate Analysis column, 
using an isocratic gradient consisting of 70% acetonitrile in 
water, at a flow rate of 1 ml/min. The product (17 nmol) was 
subjected to analysis by U C NMR (Center for Complex 
Carbohydrate Research, Athens, GA) . The sample was exchanged 
repeatedly in D 2 0 and subjected to NMR analysis at 500 MHz* 
The proton NMR spectrum was recorded on a BruXer AM500 
instrument at 28 °C. Chemical shifts are relative to acetate 
(6, 1.908 ppm) . The structure of the expected trisaccharide 
Galj8(l-*4) [Fuca(l-+3) )GlcNAc, was verified by 500 MHz 



H-l signals for GlcNAC (a-anomer) at 5«5.102, for Gal at 

5«4.467 and 4.454 ppm (for the a- and /J-anomers, respectively, 

of the trisaccharide) and for Fuc at **5.102 ppm. The 

anomeric signal for the 0-anomer of GlcNAc was obscured by the r 

residual HOD peak (5*4.72 ppm). The methyl signals of the 

GlcNAc H-acetyl group and the C6 protons of Fuc were observed 



The spectrum, recorded in D 2 0 at 28 °C, showed 
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at $*2.032 and 1.178 ppm, respectively. These chemical shifts 
natch those published for the authentic trisaccharide 
GaljS ( 1-4 ) [ Fuca (1-3)3 GlcNAc • 



g-L-Fucosidase Digestion . The neutral, 
chromatographically-purif ied radiolabeled fucosyltransf erase 
product was subjected to a-L-fucosidase digestion to confirm 
the alpha anomeric configuration of the attached fucose. 
3-[ M C] fucosyl-N-acetyllactosamine was purified by descending 
paper chromatography as described above, and an aliquot (7000 
cpms) was digested with 40 mU of a-L-fucosidase (B.C. 
3-2.1.51, Boehringer-Mannheim) in 20 fil of 100 mM Na citrate 
pH 5.5, at 37 °C for 22 hrs. The reaction was desalted by 
Dowex column chromatography and subjected to HPLC analysis 
using conditions described above for preparation of the 
radiolabeled standard. The product of the digestion was 
identified by comparison to parallel separations of L- [ 14 C] 
fucose and the 3- [ U C] fucosyl-N-acetyllactosamine starting 
material* Quantitative release of L-[ H C] fucose was achieved 
by a-L-fucosidase digestion. 
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Example V. 



Isolation of a GDP-Fuc : fl-D-G al (1.4) -D-GlcNAc 



ttf 1.31 Fucosyltransferase (Fuc-TV. DNA SEP ID 
NO: 10. Protein SEP ID NO: 11) through 
cross-hybridization : 
Molecular cloning of a human genomic DNA segment that 
cross-hybridizes to the Lewis blood group a(l, 3/1,4) fucosyl- 
transf erase cDNA: Low stringency Southern blot hybridization 
experiments have indicated to the inventor that the coding 
region of the Lewis fucosyltransferase cDNA detects strongly 
hybridizing restriction fragments, as well as several weakly 
hybridizing fragments. 

The inventor expected that the strongly hybridizing 
fragments represented one or more genes similar to the Lewis 
fucosyltransferase cDNA. To further examine the molecular 
nature of these sequences, as noted above in Example IV, the 
inventor screened a human lambda phage genomic DNA library at 
low stringency with the Lewis cDNA probe. A total of 18 
phages were isolated from phages representing approximately 
five human genomic equivalents. Southern blot analysis of 16 
of these phages allowed them to be placed into three groups, 
based upon their restriction patterns and hybridization signal 
intensity strengths. Several phages representing a class with 
strong hybridization intensities were identified. 
Cross-hybridizing restriction fragments isolated from one of 
these phages was subcloned and sequenced. 
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The homologous DNA restriction fragment maintains a 
single long open reading frame that predicts a polypeptide 
with strong similarity to the Lewis blood group a (1,3/1, 4) 
fucosyltransf erase cDNA: DNA sequence analysis of the 
sub cloned fragments identified a single long open reading 
frame within its 3' portion, beginning at base pair 1 and 
ending at base pair 1125 (SEQ ID NO: 10) (see Figure 6). This 
reading frame begins with a methionine codon that is found 
within a sequence context consistent with Kozak's consensus 
rules for mammalian translation initiation. - A hydropathy 
analysis of the protein sequence predicted by this reading 
frame (SEQ ID NO: 11) would predict a single hydrophobic 
segment at its NHj-terminus , suggesting that the predicted 
polypeptide would maintain the type II transmembrane 
orientation typical of mammalian gly cosy Itransf erases. 
Virtually the entire length of this reading frame shares a 
strikingly high amount of amino acid and nucleic acid sequence 
identity with the corresponding portion of the Lewis 
fucosyltransf erase cDNA (Figure 6) . This sequence similarity 
diverges in just a few positions, most notably at base pair 
139 within the open reading frame. This 33 base pair 
insertion, relative to the Lewis cDNA, would create a peptide 
insertion of 11 amino acids, relative to the Lewis 
fucosyltransf erase. Because of the substantial sequence 
similarity between these two DNA sequences, and their derived 
protein sequences, the inventor expected this new 
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cross -hybridizing sequence to represent a single exonic 



expression of an a(l, 3) fucosyltransf erase: To determine if 
this segment encodes a functional fucosyltransf erase, a 1,94 
)cb Earl-Xbal fragment containing the entire open reading frame 
was cloned into a mammalian expression vector, and the 
resulting plasmid (pCDNAl-Fuc-TV) was introduced into 
mammalian host cells by transf ection. COS-l cells were used 
as hosts for these experiments since these cells normally 



express virtually undetectable a(l,3)- and 

a (1,4) fucosyltransf erase activities. Likewise, COS-l cells do 
not normally express detectable amounts of cell surface 
Gal01->4(Fuca(l->3) )GlcNAc- (Lewis x, SSEA-1) or 

* 

GaljSl->3[Fuca(l->4) ]GlcNAc- (Lewis a) moieties, whereas they 
do maintain surface-display of non-fucosylated type II and 
type I oligosaccharide precursors necessary for the 
construction of such molecules. The vector pCDNAI was used, 
since this plasmid efficiently transcribes exogenous, 
subcloned sequences in COS-l hosts by virtue of the 
cytomegalovirus immediate early promoter sequences in the 
vector, and is maintained in these cells as a multicopy 
episome. 

COS-l cells transf ected with pCDNAl-Fuc-TV were first 
analyzed by flow cytometry to detect de novo, vector-dependent 



sequence, representing a heretofore undefined gene, that 



encodes a 



fucosyltransf erase 



The homologous DNA restriction fragment determines 
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surface expression of these oligosaccharides. A substantial 



fraction of these transfected cells exhibited bright staining 



(Gal|5(l-4) [Fuca(l-3)]GlcNAc-) (Fig. 8) , whereas cells 
transfected with the pCDNAl vector without insert did not 
express this determinant. By contrast, COS-1 cells 
transfected with pCDNAl-Fuc-TV, or with its control plasmid, 
did not stain with antibodies specific for the type I-based 
Lewis a trisaccharide (Fig. 8). 

Taken together, these results are consistent with the 
results of DNA sequence analysis indicating that this segment 
encodes an cr(l, 3) fucosyltransf erase competent to utilize 
neutral type II oligosaccharide precursor , but that the enzyme 
cannot efficiently utilize type I glycoconjugates and thus 
does not exhibit strong a ( 1 , 4 ) fucosyltransf erase activity. 

There is evidence supporting the possibility that one or 
more human a (1,3) fucosyltransf erases exist that can utilize 
type II acceptors whose terminal galactose residues are 
substituted with a (2, 3) sialic acid moieties. Such enzymes can 
fucosylate these molecules to form the sialyl-Lewis x 
determinant (NeuAca(2->3)Galj8(l->4) [Fuca(l->3) ]GlcNAc-) • The 
Lewis fucosyltransf erase, for example, is competent to perform 

this reaction. 

It was therefore of interest to determine if the 
fucosyltransferase apparently encoded by plasmid pCDNAl-Fuc-TV 
would be capable of constructing sialyl-Lewis X determinants. 



with a monoclonal antibody directed against the Lewis x moiety 
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COS-1 cells maintain surface-expressed glycoconjugates 
terminating in (NeuAca(2, 3)Gal/3(l,4)GlcNAc-; these represent 
acceptor substrates for sialyl-Lewis construction determined 
via the action of enzymes encoded by transfected 
fucosyltransf erase expression vectors. COS-l cells were 
therefore transfected with plasmid pCDNAl-Fuc-TV, stained with 
a monoclonal anti-sialyl Lewis x antibody, and subjected to 
flow cytometry analysis. A significant amount of staining was 
detected, relative to control cell transfected with the vector 
alone, or relative to pCDNAl-a(l,3)-Fuc-TV-transfected cells 
stained with a negative control antibody (anti-H) . (Fig. 8). 
However, these cells did not stain with an antibody 



(NeuAca(2,3)Galj8(l,3) [Fuca(l, 4) ]GlcNAc-) (Fig. 8). By 
contrast, the inventor had previously observed that a 
substantial fraction of COS-l cells transfected with the Lewis 
fucosyltransf erase expression plasmid pCDM7-o(l,3/l,4) FT 
exhibit bright staining with the sialyl-Lewis x and 
sialyl-Lewis a antibodies, as predicted by biochemical 
analysis of the acceptor substrate specificity of this enzyme. 

The conclusion that plasmid pCDNAl-Fuc-TV encodes an 
a (1,3) fucosyltransf erase was confirmed by biochemical analysis 
of the acceptor substrate requirements of the enzyme in 
extracts of COS-l cells transfected with plasmid pCDNAl-Fuc- 
TV. As expected, the enzyme in these extracts utilized the 
type II disaccharide acceptor N-acetyllactosmine, to yield the 



specific for the type I -based, sialyl-Lewis a determinant 
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predicted product Gal/3 ( l->4 ) [Fuca( l->3) ]GlcNAc. The enzyme also 
efficiently utilized the trisaccharide 
NeuAcoc(2 , 3)Gal£(l, 4)GlcNac to form the sialyl Lewis x 
tetrasaccharide (Table 2) . Under these conditions, other type 
II molecules, including lactose, and the a ( 1 , 2 ) f ucosylated 
type II acceptor Fuca(l->2)GaljS(l->4)Glc, did function as 
acceptor substrates for the fucosyltransf erase in these 
extracts, although with substantially lower efficiencies. 
(Summarized in Table 2). 

Interestingly, the type I substrate lacto-N-biose I is 
also utilized by this enzyme, although again at low efficiency 
(Table 2 ) • This suggests that the enzyme can also function as 
ah a (1, 4) fucosyltransf erase, but at a very low efficiency, as 
also suggested by the absence of at (1,4) structures on flow 
cytometry analysis. The acceptor preferences exhibited by 
this enzyme contrast with that exhibited by the Lewis 
fucosyltransf erase, which is able to efficiently utilize each 
of the four acceptors tested. 

To further demonstrate that enzymatic activity is 
directly associated with this protein, the putative catalytic 
domain of the predicted polypeptide (amino acids 43 to 374 of 
SEQ ID NO: 11) was fused to a secreted form of the IgG binding 
domain of Staphylococcus aureus protein A in the mammalian 
expression vector pPROTA, to generate the vector pPROTA-Fuc- 
TV C . Since this fusion protein would lack the putative 
transmembrane anchoring segment of the fucosyltransf erase, the 
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inventor expected it would be synthesized as a secreted 
molecule that could be affinity-purified on an IgG-containing 
matrix and subsequently tested for a (1,3) fucosyltransf erase 
activity. COS-1 cells transfected with the control vector 
pCDM7 or pPROTA generated no detectable cell-associated or 
released enzyme activity. However, conditioned media prepared 
from COS-1 cells transfected with pPROTA-Fuc-TV c contained 
significant quantities of a (1,3) fucosyltransf erase activity 
when assayed with N-acetyllactosmine. Virtually 100% of the 
released activity generated by pPR0TA-Fuc-TV e is specifically 
retained by the igG-Sepharose matrix. These results indicate 
that the protein encoded by this cloned DNA segment encodes a 
fucosyltransf erase, and demonstrate that information 
sufficient to generate a (1,3) fucosyltransf erase activity 
resides within the enzyme's COOH-terminal 332 amino acids. 

Taken together with the results of the flow cytometry 
analyses and DNA sequence analysis, these experiments indicate 
that plasmid pCDNAl-Fuc-TV encodes a novel 
a ( 1 , 3 ) fucosyltransf erase . 



T ^p^yimental Procedures for Example V, "Isolation of a 
nnp-Fiir-fl-D-Gal n . a \ -D-GlcNAc a ( l . 3 \ Fucosyltransf erase 
(Fuc-TVl through cross-hvbridization: 

cell culture . The source and growth conditions of COS-1 
cells used are as previously described. See Bnipt et ^Xl, 
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Biol. Chem. (1989) 26£: 3436-3447 and Raian et al. J. Biol. 
Chem . (1989) 264:11158-11167. 

Antibodies > The anti-Lex antibody anti-SSEA-1 (mouse 
monoclonal IgM as ascites) was used. Solter et al . Proc . Nat . 
Acad. Sci. (USA) (1978), 75:5565-5569. Anti-H and anti-Lewis a 
antibodies (mouse monoclonal IgM, antigen affinity purified) 
were purchased from Chembiomed Ltd. (Edmonton, Alberta) . 
Anti-sialyl Lewis x antibody CSLEX1 (Fukushima et al . Cancer 
Res , (1984) 4455279-5285) (mouse monoclonal IgM, HPLC 
purified) and anti-sialyl Lewis a antibody CSLEA1 r chia et al, 
Cancer Res . (1985) 45 : 435-437) (mouse monoclonal IgG3, 
ammonium sulfate precipitate) were used. Fluorescein- 
cbnjugated goat anti-mouse IgM or IgG antibodies were 
purchased from Sigma. 

Human genomic library construction . High molecular weight 
human genomic DNA was prepared from peripheral blood 
leukocytes. Genomic DNA was subjected to partial digestion 
with the restriction endonuclease Sau3A. The partially 
digested genomic DNA was size fractionated by 
ultracentrifugation through a sodium chloride gradient. 
Fractions enriched for DNA fragments between 8 Kb and 20 Kb 
were ligated to Xhol digested lambda FIX (Stratagene) phage 
arms that had been partially filled in with dTTP and dCTP to 
make the ends compatible with the Sau3A fragments. 

The ligation mixture was packaged in vitro with 
commercial packaging extracts (Stratagene) , tittered on E. 
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coli host TAP90. Approximately 1.0 X 10 6 recombinant lambda 
phage were screened by plaque hybridization. Plague lifts 
were prepared using nitrocellulose filters (Schleicher and 
Schuell) and were prehybridized at 42 °C for 16 hours in 50% 
formamide, 5X SSC, 10X Denhart's solution, and 0.1% SDS. 
Filters were hybridized for 72 hours at 35 °C in 
prehybridization solution containing 10% dextran sulfate, and 
100 micrograms per ml denatured salmon sperm DNA. The probe 
consisted of a 1.7 Kb Xhol-Xbal fragment isolated from the 5' 
end of a cDNA insert encoding the Lewis blood group a (1,3/1,4) 
fucosyltransf erase , which was labeled with [o- 32 ?] dCTP. The 
filters were rinsed three times for 20 minutes each at room 
temperature in 2X SSC and then once for 40 minutes at 50 °C and 
IX SSC, 0.5% SDS. Filters were then subjected to 
autoradiography. Eighteen independent hybridization-positive 
plagues were identified after 2 additional cycles of plague 
hybridization. Phage DNAs were prepared from liquid lysates 
and were subsequently characterized by restriction 
endonuclease digestions and Southern blot analyses. 

DNA sequence analysis . Phage DNA was digested with 
various restriction enzymes, and fragments homologous to the 
human a (1, 3/1, 4) fucosyltransf erase cDNA were gel purified and 
ligated into the multicloning site of pTZ18R. Representative 
subclones were sequenced by the dideoxy chain determination 
method using T7 DNA polymerase (Pharmacia LKB Biotechnology, 
Inc.) and oligonucleotides synthesized according to flanking 
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plasmid sequences and subsequently according to the insert 
sequence. This sequence data was used to generate additional 
synthetic deoxynucleotides which were then used to sequence 
remaining portions of the inserts. Sequence analysis was 
performed using the sequence analysis software package of the 
University of Wisconsin Genetics Computer Group. 
Construction of Plasmid pCDNAI-Fuc-TV 
A 1.94 kb Earl-Xbal fragment was isolated from a 
representative phage taken from a strongly hybridizing class 
of phages, made blunt with the Klenow fragment of E. coli DNA 
polymerase I, and cloned into the EcoRV and Xbal sites in the 
mammalian expression plasmid pCDNAl (Invitrogen) . One plasmid 
with a single insert in the sense orientation with respect to 
the plasmid' s CMV promoter enhancer sequences was designated 
pCDNAl-Fuc-TV. 

FACS analysis . COS-l cells transfected with plasmid DNAs 
were harvested 48-72 hours after transf ection, and stained 
with monoclonal antibodies diluted in staining media. Anti- 
Lewis a and anti-H antibodies (mouse IgM monoclonal; 
antigen-affinity purified; Chembiomed, Edmonton) were used at 
10 Mg/ml. Anti-SSEA-1 (mouse monoclonal IgM; ascites) was 
used at a dilution of 1:1000. Anti-sialyl-Lewis x (mouse 
monoclonal IgM; HPLC purified from ascites) was used at 10 
/ig/ml. Anti-sialyl Lewis a (mouse monoclonal IgG3; ammonium 
sulfate precipitate of ascites) was used at a dilution of 
1:500. Cells were then stained with fluorescein 
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isothiocyanate-conjugated goat anti-mouse IgM or IgG, as 
appropriate, and were then subjected to analysis on a FACScan 

(Becton-Dickinson) • 

Fucosvltransf erase assays . Cell extracts containing 1% 
Triton X-100 were prepared from transfected COS-l cells. 
Fucosyltransf erase assays were performed in a total volume of 
20 $1, and contained 25 WL sodium cacodylate, pH6.2, 5 mM ATP, 
10 mM L~fucose, 10 mM MnCl 2 , 3 pM GDP- u C-fucose r and 5 Ml of 
cell extract. Acceptor substrates were added to a final 
concentration of 20 mM. Reactions were incubated at 37°C for 
1 hour and terminated by addition of 20 m! ethanol, followed 
by addition of 600 Ml of distilled water. An aliquot of each 
reaction (50 m!) was subjected to scintillation counting to 
determine total radioactivity in the reaction. Another 
aliquot (200 Ml) was applied to a column containing 400 m! of 
Dowex 1X2-400, formate form. The flow through fraction, and 2 
Ml of a subsequent water elution, were collected and pooled, 
and an aliquot was subjected to scintillation counting to 
quantitate incorporation of radioactive fucose into neutral 
product . 

Example VI. cloning and expre ssion of a DMA sequence 

encoding a GDP-Fuc;fl-D-Galf 1 T 41 -D-GlcNAc 
ttfl, 3) fucosvltransf erase" (Fuc-TV I ?DNA SEQ ID 
no; 13, protein SEP TP NO: 141 through cross 
hybridization. 
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Biochemical and genetic studies indicate that the human 
genome encodes two or more distinct GDP-L-fucose:#-D- 
Galactoside 3-a-L-Fucosyltransf erases (Potvin et al, J. Biol. 
Chem. , 265:1615-1622, 1990; Watkins, Adv . Hum . Genet . . 10:1- 
116, 1980) . the inventor has recently described a cloned cDNA 
that encodes one of these enzymes, that is thought to 
represent the product of the human Levis blood group locus 
(Kukowska-Latallo et al, Genes Devel . . 4:1288-1303, 1990) (DMA 
SEQ ID N0:1 and Protein SEQ ID NO: 2). In consideration of the 
possibility that these GDP-L-fucose:£-D-Gala'ctoside 3-a-L- 
Fucosyltransf erases might be encoded by a family of 
structurally-related genes, the inventor sough to isolate 
other such members by cross-hybridization methods, using the 
cloned Lewis fucosyl transferase cDNA. 



Molecular cloning of a human genomic DNA segment that 
crosshvbridizes to the Levis blood group 

a ( 1 , 3 /l . 41 fucosvltransf erase cDNA - As noted above in Examples 
IV and V, low stringency Southern blot hybridization 
experiments indicate that the coding region of the Levis 
fucosyltransf erase cDNA detects strongly hybridizing 
restriction fragments, as well as several weakly hybridizing 
fragments. To further examine the molecular nature of these 
sequences, the inventor screened a human lambda phage genomic 
DNA library at low stringency with the Lewis cDNA probe. A 
total of 18 phages were isolated from phages representing 
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approximately five human genomic equivalents . Southern blot 
analysis of 16 of these phages allowed them to be placed into 
three groups, based upon their restriction patterns and 
hybridization signal intensity strengths. Several phages 
representing a class with strong hybridization intensities 
were identified. Cross-hybridizing restriction fragments 
isolated from one of these phages was subcloned and sequenced. 

The homologous DNA restriction fragment m aintains a 
single long open reading frame that predicts a polypeptide 
with strong similarity to the Lewis bloo d group 
a f 1 . 3 / 1 . 4 ) fucosvl transferase cDNA - DMA sequence analysis of 
the cross-hybridizing subcloned fragment (SEQ ID NO: 13) 
identified a single long open reading frame , beginning at base 
pair 1 and ending at base pair 1080 (Figure 7) . This reading 
frame begins with a methionine codon that is found within a 
sequence context consistent with Kozak's consensus rules for 
mammalian translation initiation. A hydropathy analysis of 
the protein sequence predicted by this reading frame (SEQ ID 
NO: 14) predicts a single hydrophobic segment at its NH 2 - 
terminus, suggesting that it represents a 359 amino acid 
protein (SEQ ID NO: 14) that is predicted to maintain the type 
II transmembrane orientation typical of mammalian 
glycosyltransf erases, virtually the entire length of this 
reading frame shares a strikingly high amount of amino acid 
(not shown) and nucleic acid sequence identity with the 
corresponding portion of the Lewis fucosyltransferase cDNA 
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( Figure 7) . Because of the substantial sequence similarity 
between these two DNA sequences, and their derived protein 
sequences, we expected this new cross-hybridizing sequence to 
represent a single exonic sequence, representing a heretofore 
undefined gene, that encodes a fucosyltransf erase. 

The homologous DNA restrict ion fragment determines 
expression of an a(l. 3) fucosvl transferase - To determine if 
this segment encodes a functional fucosyltransf erase, a 1.2 kb 
fragment containing the entire open reading frame was 
generated by the polymerase chain reaction and was cloned into 
a mammalian expression vector, and the resulting plasmid 
(pCDNAl-Fuc-TVI) was introduced into mammalian host cells by 
transf ection. COS-1 cells were used as hosts for these 
experiments since these cells normally express virtually 
undetectable a (1,3)- and a ( 1 , 4 ) fucosyltransf erase activities 
(Kukowska-Latallo et al, Genes Devel . . 4:1288-1303, 1990). 
Likewise, COS-1 cells do not normally express detectable 
amounts of cell surface Gal/3l-4 [Fucoc (1-3) ]GlcNAc-( Lewis x, 
SSEA-1) or Gal01-3 [Fuca( 1-4) ]GlcNAc- (Lewis a) moieties, 
whereas they do maintain surface-display of non-fucosylated 
type II and type I oligosaccharide precursors necessary for 
the construction of such molecules (Kukowska-Latallo et al, 
Genes Devel. . 4:1288-1303, 1990). The vector pCDNAl was used 
since this plasmid efficiently transcribes exogenous, 
subcloned sequences in COS-1 hosts by virtue of the 
cytomegalovirus immediate early promoter sequences in the 
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vector, and is maintained in these cells as a multicopy 
episome (Kukowska-Latallo et al, Genes Devel. . 4:1288-1303, 
1990) . COS-1 cells transfected with pCDNAl-Fuc-TVI were first 
analyzed by flow cytometry to detect de novo, vector-dependent 
surface expression of these oligosaccharides. A substantial 
fraction of these transfected cells exhibited bright staining 
with a monoclonal antibody directed against the Lewis x moiety 
(Gal/3 ( l-»4 ) (Fuca(l-O) ]GlcNAc-) (Figure 8), whereas cells 
transfected with the pCDNAl vector without insert did not 
express this determinant. By contrast, COS-1 cells 
transfected with pCDNAl-Fuc-TVI , or with its control plasmid, 
did not stain with antibodies specific for the type I-based 
Lewis a trisaccharide (Figure 8) . Taken together, these 
results are consistent with the results of the DNA sequence 
analysis indicating that this segment encodes an 
a (1,3) fucosyltransf erase competent to utilize neutral type II 
oligosaccharide precursor, but that the enzyme cannot utilize 
type I glycoconjugates and thus does not exhibit 
a ( 1 , 4 ) fucosyltransf erase activity . 

There is evidence supporting the possibility that one or 
more human a (1,3) fucosyltransf erases exist that can utilize 
type II acceptors whose terminal galactose residues are 
substituted with a (2, 3) sialic acid moieties (Potvin et al, 
Biol, Chem. . 265:1615-1622, 1990; Holmes et al, J\_!ifiii 
Chem . . 261:3737-3743, 1986; Palcic et al, CaybohYflt Res,, 
190:1-11, 1989). Such enzymes can fucosylate these molecules 
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to form the sialyl-Lewis x determinant 
(NeuAca ( 2-3 ) Galtf ( 1-4 ) [ Fuccr ( 1-3 ) ] GlcNAc- ) . The Lewis 
fucosyltransf erase , for example, is competent to perform this 
reaction (Palcic et al, Carbohyd. Res, . 190:1-11, 1989; Lowe 
et al, Cell . 63:475-484, 1990). It was therefore of interest 
to determine if the fucosyltransf erase apparently encoded by 
plasmid pCDNAl-a ( 1 , 3 ) Fuc-TVI would be capable of constructing 
sialyl-Lewis x determinants. COS-1 cells maintain surface- 
expressed glycoconjugates terminating in 

(NeuAca ( 2, 3 )Gal)8( 1,4) GlcNAc- (Lowe et al, Cell . 63:475-484, 
1990; Fukuda et al, J. Biol. Chem. . 263:5314-5318, 1988); 
these represent acceptor substrates for sialyl-Lewis x 
construction determined via the action of enzymes encoded by 
transfected fucosyltransf erase expression vectors (Lowe et al, 
Cell . 63:475-484, 1990). COS-1 cells were therefore 
transfected with plasmid pCDNAl-Fuc-TVI , stained with a 
monoclonal anti-sialyl Lewis x antibody, and subjected to flow 
cytometry analysis. A significant amount of staining was 
detected, relative to control cell transfected with the vector 
alone, or relative to pCDNAl-Fuc-TVI -transfected cells stained 
with a negative control antibody (anti-H, Figure 8). However, 
these cells did not stain with an antibody specific for the 
type I-based, sialyl-Lewis a determinant 
(NeuAca(2,3)Gal|8(l,3) [Fuca(l,4) ]GlcNAc-) (Figure 8). By 
contrast, the inventor has previously demonstrated that a 
substantial fraction of COS-1 cells transfected with the Lewis 
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fucosyltransf erase expression plasmid pCDM7-0£(l,3/l,4)FT 
exhibit bright staining with the sialyl-Lewis x and sialyl- 
Lewis a antibodies (Lowe et al, Cell, 63:475-484, 1990), as , 
predicted by biochemical analysis of the acceptor substrate 
specificity of this enzyme (Palcic et al # Cfryfrofryd* 

190:1-11, 1989). 

The conclusion that plasmid pCDNAl-Fuc-TVI encodes an 
a( 1,3) fucosyltransf erase was confirmed by biochemical analysis 
of the acceptor substrate requirements of the enzyme in 
extracts of COS-1 cells transfected with plasmid pCDNAl-Fuc- 
TVI. As expected, the enzyme in these extracts utilized the 
type II disaccharide acceptor N-acetyllactosmine, to yield the 
predicted product Galj3(l->4) [Fuca(l-3) JGlcNAc. The enzyme also 
eff iceintly utilized the trisaccharide 
NeuAca(2,3)Gal0(l,4)GlcNAc to form the sialyl Lewis X 
tetrasaccharide (Table 2). Under these conditions, other type 
II molecules, including lactose, and the a ( 1 , 2 ) f ucosylated 
type II acceptor Fuca(l-2)Gal0(l-4)Glc, did not function as 
acceptor substrates for the fucosyltransf erase in these 
extracts, with any detectable efficiency- The type I 
substrate lacto-N-biose I was also not utilized by this 
enzyme. This suggests that the enzyme can function 
effectively only as an a (1,3) fucosyltransf erase, as also 
suggested by the flow cytometry analyses. The acceptor 
preferences exhibited by this enzyme contrast significantly 
with those exhibited by the Lewis fucosyltransf erase, which is . 
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able to efficiently utilize each of the four acceptors tested 
(Kukowska-Latallo et al, Genes Devel . , 4s 1288-1303, 1990; 
Mollicone et al, Eur. J. Biochem. . 191:169-176, 1990). Taken 
together with the results of the flow cytometry analyses 
presented in Figure 8, and DNA sequence analysis, these 
biochemical experiments, summarized in Table 2, indicate that 
plasmid pCDNAl-Fuc-TVI encodes a novel fucosyltransf erase with 
its own distinct acceptor specificity. 

Since the protein sequence of Fuc-TVI is so very similar 
to the sequences of FUC-TV and. FUC-TIII , it may be expected 
that a catalytically active , secreted protein A-Fuc-TVI fusion 
porotein may be generated by fusing residues 43 through 359 of 
Fuc-TVI (S2Q ID NO: 14) to the protein A segment, in a manner 
identicial to that used to generate pPROTA-a (1,3/1,4) FT C , aad 
pPROTA- Fuc -TV o 

I 

Experimental Procedures for Example VI "Cloning and 
expression of a DMA sequence encoding GDP-Fuc-fl-D° 
Galf 1.4)-D-GlcNAc ttfl.3) f ucosvltransf erase" (Fuc-TVI ? 
| DMA SEP ID rcPsl^. protein SE P ID MP;fr51 through cross 

hybridation . 

Cell culture. The source and growth conditions of CPS-1 cells 
are as previously described (Ernst et al, J. Biol. Chero. . 
265:3436-3447, 1989; Rajan et al, J. Biol. Chem, , 264slll5B~ 
11167, 1989). 
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Antibodies . The anti-Lex antibody anti-SSEA-1 (So Iter et al, 
Proc. Natl. Acad. Sci. USA . 75:5565-5569, 1978) (mouse 
monoclonal IgM as ascites) was provided by Dr. Davor Solter 
(Wistar Institute, Philadelphia). Anti-H and anti-Lewis a 
antibodies (mouse monoclonal IgM, antigen affinity purified) 
were purchased from Chembiomed Ltd. (Edmonton, Alberta) - 
Anti-sialyl Lewis x antibody CSLEX1 (Fukushima et al, Cancer 
Res. , 44:5279-5285, 1984) (mouse monoclonal IgM, HPLC 
purif ied) and anti-sialyl Lewis a antibody CSLEA1 (Galton et 
al, Ninth int- Convoc. Immune . Amherst, NY, pp. 117-125, 
Karger, Basel; Chia et al, cancer Res . . 45:435-437, 1985) 
(mouse monoclonal IgG3, ammonium sulfate precipitate) were 
provided by Dr. P. Terasaki (UCLA, Los Angeles) . A pooled 
mouse IgG antibody preparation (Mslg) was purchased from 
Coulter. Fluorescein-conjugated goat anti-mouse IgM or IgG 
antibodies were purchased from Sigma. 



Human genomic library construction. High molecular weight 
human genomic DNA was prepared from peripheral blood 
leukocytes as described previously (Ernst et al, J. Biol. 
Chem. . 265:3436-3447, 1989) . Genomic DNA was subjected to 
partial digestion with the restriction endonuclease Sau3A. 
The partially digested genomic DNA was size fractionated by 
ultracentrifugation through a sodium chloride gradient. 
Fractions enriched for DNA fragments between 8 Kb and 20 Kb 
were ligated to Xhol digested lambda FIX (Stratagene) phage 
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arms that had been partially filled in with dTTP and dCTP to 
make the ends compatible with the Saul A fragments. The 
ligation mixture was packaged in vitro with commercial 
packaging extracts (Stratagene) , titered on E. coll host TAP90 
(Patterson et al, Nucl. Acids. Res, . 15:6298, 1987). 
Approximately 1.0 X 10* recombinant lambda phage were screened 
by plaque hybridization. Plaque lifts were prepared using 
nitrocellulose filters (Schleicher and Schuell) and were 
prehybridized at 42°C for 16 hours in 50% formamide, 5X SSC, 
10X Denhart's solution, and 0.1% SDS. Filters were hybridized 
for 72 hours at 35°C in prehybridization solution containing 
10% dextran sulfate, and 100 micrograms per ml denatured 
salmon sperm DNA. The probe consisted of a 1.7 Kb Xhol-Xbal 
fragment isolated from the 5' end of a cDNA insert encoding 
the Lewis blood group a (1,3/1, 4) fucosyltransf erase (Kukowska- 
Latallo et al, Genes Devel. . 4:1288-1303, 1990), which was 
labeled (Feinberg et al, Anal. Biochem. . 132:6-13, 1983) with 
[cr- 32 P] dCTP. The filters were rinsed three times for 20 
minutes each at room temperature in 2X SSC and then once for 
40 minutes at 50 °C and IX SSC, 0.5% SDS. Filters were then 
subjected to autoradiography. Eighteen independent 
hybridization-positive plaques were identified after 2 
additional cycles of plaque hybridization. Phage DNAs were 
prepared from liquid lysates (Maniatis et al, Molecular 
Cloning: A Laborato ry Manual. Cold Spring Harbor Lab. , Cold 
Spring Harbor, NY, 1982) and were subsequently characterized 
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by restriction endonuclease digestions and Southern blot 
analyses * 

DNA sequence analysis. Phage DNA was digested with various 
restriction enzymes, and fragments homologous to the human 
a(l,3/l,4)fucosyltransferase cDNA were gel purified and 
ligated into the multicloning site of pTZ18. Representative 
subclones were sequenced by the dideoacy chain determination 
method (Sanger et al, Proc. Nat l- Acad. Sci. USA, 74:5463- 
5467, 1977) using T7 DNA polymerase (Pharmacia LKB 
Biotechnology, Inc.) and oligonucleotides synthesized 
according to flanking plasmid sequences and subsequently 
according to the insert sequence. This sequence data was used 
to generate additional synthetic deoxynucleotides which were 
then used to sequence remaining portions of the inserts. 
Sequence analysis was performed using the sequence analysis 
software package of the University of Wisconsin Genetics 
Computer Group (Devereux et al, Nucl. Ac ids. Res.. 12:387-395, 
1984) . 

Transfection and expressio n of the insert in pCDNAl-Fuc-TVI . 
A 1.2 kb fragment was generated by the PCR, using DNA isolated 
from a representative phage taken from a strongly hybridizing 
class of phages, and cloned into the Hindlll site in the 
mammalian expression plasmid pCDNAl (Invitrogen) . One plasmid 
with a single insert in the sense orientation with respect to 
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the plasmid's CMV promoter enhancer sequences was designated 



pCDNAl-Fuc-TVI • 



FACS analysis, COS-1 cells trans feet ed with plasmid DNAs were 
harvested (Rajan et al, J. Biol. Chem. . 264:11158-11167, 1989) 
48-72 hours after transf ection, and stained with monoclonal 
antibodies diluted in staining media, as previously described 
(Kukowska-Latallo et al, Genes Devel . , 4:1288-1303, 1990; 
Ernst et al, J. Biol. Chem, . 265:3436-3447, 1989). Anti-Lewis 
a and anti-H antibodies (mouse IgH monoclonal; antigen- 
affinity purified; Chembiomed, Edmonton) were used at 10 
jig/ml. Anti-SSEA-1 (mouse monoclonal IgM; ascites) was used 
at a dilution of 1:1000. Anti-sialyl-Lewis x (mouse 
monoclonal IgH; HPLC purified from ascites) was used at 10 
Mg/ml. Ant i -sialyl Lewis a (mouse monoclonal IgG3; ammonium 
sulfate precipitate of ascites) was used at a dilution of 
1:1000. Control mouse Ig63 antibody (Mslg, Coulter) was used 
at a concentration of 10 /ig/ml. Anti-VIM-2 antibody (mouse 
monoclonal IgM; ascites) was used at a dilution of 1:200. 
Cells were then stained with fluorescein isothiocyanate- 
conjugated goat anti-mouse IgM or igG, as appropriate, and 
were then subjected to analysis on a FACScan (Becton- 
Dickinson) r as described previously (Kukowska-Latallo et al, 
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Fucosvltransf erase assays. Cell extracts containing 1% Triton 
X-100 were prepared from transfected COS-1 cells, using 
procedures described previously (Kukowska-Latallo et al, Genes 
Devel . . 4 2 1288-1303, 1990). Fucosyltransf erase assays were 
performed in a total volume of 20 jul, and contained 50 m£2 
sodium cacodylate, pH 6.2, 5 mM ATP, 10 saM fucose, 20 aB2 E3nCl 2 , 
3(M GDP- u C~fucose, and 5 til (30 Mg protein) of cell extract o 
Acceptor substrates were added to a final concentration of 20 

Reactions were incubated at 37 °C for 1 hour and 
terminated by addition of 20 ^1 ethanol, followed by addition 
of 600 /Al of distilled water. An aliquot of each reaction (50 
#1) was subjected to scintillation counting to determine total 
radioactivity in the reaction., Another aliquot (200 jaI) was 
applied to a column containing 400 #1 of Dowex 1X2-400, 
formate form (Raj an et al, J, Biol. Chem. . 264s 11158-11167, 
1989) o The flow through fraction, and 2 pi of a subsequent 
water elution, were collected and pooled, and an aliquot was 
subjected to scintillation counting to quantitate 
incorporation of radioactive fucose into neutral product. 
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Table 2 presents the relative product formation rates 
obtained with low molecular weight acceptor substrates using 
cell extracts containing recombinant human 
a(l, 3) fucosyltransf erases expressed in transfected COS-1 
cells, as described in the preceding sections. 
Fucosyltransf erase assays were performed as described in 
detail in Lowe et al , J. Biol, Chem. . (1991), 266:17467-17477; 
Weston et al . J. Biol. Chem. (1992), 167:4152-4160; and 
Kukowska-Latallo et al . Genes. Devel.. (1990), 4:1288-1303. 
For each enzyme, the same extract was used, with saturating 
amounts of each acceptor oligosaccharide (20 mM, except for 
2 ' -fucosy lactose which was used at 5 mM) , and GDP- [ U C ] f ucose 
was present at 3 pM. Reaction times and enzyme amounts were 
adjusted to ensure a linear rate of product formation (less 
than 15% of the GDF-fucose substrate consumed) . Products were 
separated by column chromatography and quant itated by liquid 
scintillation counting, and their structures were confirmed by 
high performance liquid chromatography, as described in Lgj*e 
et al , J- Biol. Chem. . (1991), 266:17467-17477; Westoyi et al, 
J. Biol. Chem. (1992), £fiZ:4152-4160; and Kukowslea-Latallo et 
aj,, Genes. Devel. . (1990), 1:1288-1303. 

Example VII. Chimeric Enzvmes. 

The abbreviations used are Lewis a (Le a ) , 
GalBl-3[Fucal-4]GlctfAc; Lewis x (Le x ) , GalBl->4[Fucotl-0]GlcJTAc; 
sialyl Lewis x (sLe x ) , NeuWAca2^3GalBl-*4[Fucal-3]GlcNAc; sialyl 
Lewis a (sLe a ) , NeuWAca2-^3GalBl-»3[Fucal-*4]GlcWAc; H, 




4u 
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Fucal->2Galfll->4GlcNAc a(l,3)fucosyltransferase; oc(l,3)Fuc-T, 
GDP-fucose:£-D-N-acetylglucosaminide 3-a-L-fucosyltransf erase; 
PCR, polymerase chain reaction; kb, kilobase; bp, base pair. 
Nomenclature - Previous literature (Weston, B.W., et al 
(1992) J. Biol, Chem., vol- 267, 4152-4160; Weston, B. W. , et 
al (1992) J. Biol. Chem., vol. 267, 24575-24584; Natsuka, S., 
and Lowe, J.B. (1994) Curr. Opin. Struct. Biol., vol. 4, 683- 
691) has summarized the properties of, and terminology for, 
Fuc-TIII (the human Lewis blood group 

ct(l,3/l,4)fucosyltransferase; Love, J.B., et al (1990) cell, 
vol. 63, 475-484; Kukowska-Latallo , J.F., et al (1990) Genes 
Devel., vol. 4, 1288-1303), Fuc-TV (Weston, B.W. , et al (1992) 
J. Biol. Chem., vol. 267, 4152-4160; Koszdin, K.L. , and Bowen, 
B.R. (1992) Biochem. Biophys. Res. Commun., vol. 187, 152- 
157), and Fuc-TVI (the "plasma-type" a(l, 3) fucosyltransf erase; 
Weston, B. W. , et al (1992) J. Biol. Chem., vol. 267, 24575- 
24584; Mollicone, R. , et al (1994) J. Biol. Chem., vol. 269, 
12662-12671) . 

Site-directed mutagenesis - Site directed mutagenesis 
procedures were used to install novel restriction sites in the 
wild type Fuc-TIII and Fuc-TVI coding sequences to facilitate 
excision and replacement of DNA segments corresponding to the 
various Fuc-TIII and Fuc-TVI. DNA fragments corresponding to 
the coding regions of Fuc-TIII and Fuc-TVI were first 
generated by the polymerase chain reaction (Saiki, R.K. , et al 
(1985) Science, vol. 230, 1370-1374). Fifty nanograms of 
plasmid template (pCDM7-a(l,3/l,4)FT, (Kukowska-Latallo, J.F., 
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et al (1990) Genes Devel., vol. 4, 1288-1303), or pcDNAI-Fuc- 
TVI, (Weston, B. W. f et al (1992) J. Biol. Chem. , vol. 267, 
24575-24584) were used in reactions with the Gene Amp Kit 
(Perkin, Elmer, Cetus) . Primers were used that flank the 
coding region of each gene (Fuc-TIII sense primer is 
V GCGCGAATTCATGGATCCCCTGGGTGCAGCCAAGCCACAAT, Fuc-TIII anti- 



^tQ It? »° & 

A primer is GCGCTCTAGAGGCAGATGAGGTTCCCGGCAGCCCAGGCAC; Fuc-TVI: 
^sense primer is GCGCGAATTCTCCTCTCTCCCCACTTCCCAGAGACT , Fuc-TVI 

anti-sense primer is 
- CGCGTCTAGAGGTGAAGCricAGGCAAACGAGTCCTTAGGT) Primers were 

designed to yield PCR products with an EcoBl site at the 5' 
end and an XJbal site at the 3' end. Twenty cycles of 
amplification were performed, consisting of 1.5 min at 94 °C, 
followed by annealing and extension for 3.5 min at 72 °C. The 
PCR products were digested with EcoRI and Xbal and were cloned 
between these sites in the vector pcDNAI (Invitrogen) , using 
standard molecular biology procedures (Maniatis, T., et al 
(1989) Molecular Cloning: A Laboratory Manual (Cold Spring 
Harbor Laboratory Press, NY)). The DNA sequence of the insert 
in each vector was determined using the dideoxy chain 
termination method (Sanger, F., et al (1977) Proc. Natl. Acad. 
Sci. USA, vol. 74, 5463-5467) and T7 DNA polymerase 
(Sequenase, US. Biochemical Corp.). A representative error- 
free clone was chosen for each gene (termed pcDNAI-Fuc-TIII 
and pcDNAI-Fuc-TVI) and was used for expression and 
mutagenesis procedures. 
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The coding regions of Puc-TIII and Fuc-TVI were mutated 
to create unique inter-subdomain restriction endonuclease 
sites. Restriction sites were chosen to correspond to 
positions within the conserved amino acid sequence between 
6Ubdomains, to maintain the wild type protein sequence, and to 
be unique within the plasmid vector. For Fuc-TIII, this 
involved installation of five new inter-subdomain restriction 
sites, and elimination of an MscI site. Creation of six new 
inter-subdomain restriction sites was required for Fuc-VI. 
Mutagenesis was completed using the p-Alter Mutagenesis Kit 
(Pr omega) , after subcloning the inserts in pcONAI -Fuc-TIII and 
pcDNAI - Fuc -TVI between the £coRI and Xbal sites of the plasmid 
p-Alter (Promega) . Simultaneous multiple oligonucleotide site 
directed mutagenesis was performed on single strand templates, 
using procedures recommended by the manufacturer. Sequences 
of the mutagenic oligonucleotides are shown in Figure 9b. 
Candidate mutant clones were screened for the desired 
restriction sites by restriction enzyme digestion. Mutated 
clones containing each of the desired mutations were sequenced 
(Sanger, F. , et al (1977) Proc. Natl. Acad. Sci. USA, vol. 74, 
5463-5467) in their entirety to identify ones that contained 
no undesired additional mutations. Single, appropriately 
mutated but fully correct clones for p-Alter-Fuc-TIII and 
p-Alter-Fuc-TVT , were digested with EcoRI and XJbal, and the 
released mutated inserts were recloned between the EcoKl and 
Xbal sites of pcDNAI. 
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Construction of recombinant Fuc-T Chimeras via cassette 
mutagenesis - All recombinant a(l,3)Fuc-T chimeras except 
those involving subdomain 3 exchange (Fuc-TC7 and Fuc-TC13) 
were constructed by exchanging various restriction fragments, 
flanked by unique restrictions sites, corresponding to the 
subdomains that differ between Fuc-TIII and Puc-TVI. In 
chimeras constructed via subdomain 3 exchange (Fuc-TC7 and 
Fuc-TC13, Fig. 11), Fuc-TIII and Fuc-TVI sense and anti-sense 
oligonucleotide primers encoding subdomain 3 were annealed, 
phosphorylated (Maniatis, T., et al (1989) Molecular Cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
NY) ) , and the resulting 39 bp "synthetic" double stranded DNA 
segments were cloned between the Smal and fist 1071 sites in 
pcDNAI -Fuc-TIII or pcDNAI-Fuc-TVT . The insert in each chimera 
was sequenced in its entirety. 

Finally, chimera Fuc-TC21 was constructed by replacing 
subdomains 4 and 5 of Fuc-TIII with a region corresponding to 
subdomains 4 and 5 of Fuc-TV. The PCR, and oligonucleotide 
primers containing a 5' Bst 11071 site and a 3' Bgl II site 
(sense primer: CCGACTCCAGTGTATACCCACA6GCAGAC ; anti-sense 
primer: GTCGGAGTCAGATCTGTAGGACAT) , were used to generate this 
segment. PCR amplification conditions were chosen to minimize 
Taq polymerase-induced errors (annealing at 94 °C for 3 min; 20 
cycles consisting of 1 min at 94 °C, 45 sec at 65°C, and 1 min 
at 72 °C; a final 7 min extension at 72 °C). The PCR product 
was digested with Bst 11071 and Bgl II, and the resulting 
fragment was used to replace the corresponding Bst 11071 and 
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Bgl II fragment (subdomains 4 and 5) of the Fuc-TIII coding 
region. An error free clone identified by sequence analysis 
was termed Fuc-TC21 f and was used for transfection analyses. 

Transfection of C05-7 cells - COS -7 cells were cultured 
and transfected (DEAE-dextran method) as previously described 
(Kukowska-Latallo, J.F., et al (1990) Genes Devel., vol. 4, 
1288-1303; Natsuka, S., et al (1994) J . Biol. Chem. , vol. 269, 

16789-16794) • Cell were transfected with 20 /ig of negative 
control vector plasmid DNA (pcDNAI) , or with the plasmid DNAs 
pcDNAI-Fuc-TIII, pcDNAI-Fuc-TVI, or each of the various 
pcDNAI-Fuc-TCs . Cells were also co-trans feet ed with 1 of 
the plasmid pCDM8-CAT (Smith, P.L., and Lowe, J.B. (1994) J . 
Biol. Chem., vol. 269, 15162-15171) to allow for normalization 
of flow cytometry and Western blot data to transfection 
efficiency. 

Antibodies - Anti-Lewis x (anti-SSEA-1, IgM, as ascites; 
So Iter, D., and Knowles, B.B. (1978) Proc. Natl. Acad. Sci. 
£75 A, vol. 75, 5565-5569) was purchased from the Developmental 
Studies Hybridoma Cell Bank (Iowa City, Iowa)). Anti-sialyl 
Lewis x (CSLEX1, IgM; Fukushima, K. , et al (1984) Cancer Res., 
vol. 44, 5279-5285) and anti-sialyl-Lewis a (CSLEA1, IgG; 
Chia, D. , et al (1985) Cancer Res . , vol. 45, 435-437) 
monoclonal antibodies were purchased from the UCLA Tissue 
Typing Laboratory (Los Angeles, CA) . Anti-Lewis a ascites and 
anti-H ascites were produced from hybridoma cell lines 151-6- 
A7-9 (Blaszczyk, M., et al (1984) Arch. Biochem. Biophys., 
vol. 233, 161-168) and BE2 (Young, W.W. , et al (1981) J. Biol. 
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Chem., vol. 256, 10967-10972) obtained from the American Type 
Culture Collection (ATCC, Rockville, MD) . 

Flow cytometry analysis - Transfected COS-7 cells were 
harvested 72 h after transf ection, and were stained with 
antibodies diluted in staining media (Ernst, L.K., et al 
(1989) J. Biol. Chem., vol. 265, 3436-3447), using procedures 
previously described (Lowe, J.B., et al (1990) Cell, vol. 63, 
475-484; Kukowska-Latallo, J.F., et al (1990) Genes Devel., 
vol. 4, 1288-1303; Macher, B.A. , et al (1991) Glycobiology , 
vol. 1, 577-584). Anti-Lewis a and anti-H ascites were used 
at a dilution of 1:500 and 1:100 respectively. Anti-SSEA-1 
ascites was used at a dilution of 1:1000. Anti-sialyl Lewis a 
was used at a concentration of 10.8 jug/ml. Anti-sialyl Lewis x 
was used at a dilution of 10 pg/ml. Cells were subsequently 
washed, and then stained with either a fluorescein 
isothiocyahate-conjugated goat anti-mouse IgM (Vector) used at 
a dilution of 1:200, or with a fluorescein isothiocyanate- 
conjugated goat anti-mouse IgG (Sigma) , used at a dilution of 
40 jig/ml. Stained cells were then analyzed on a Becton- 
Dickinson FACScan (Lowe, J.B., et al (1990) Cell, vol. 63, 
475-484; Kukowska-Latallo, J.F. , et al (1990) Genes Devel., 
vol. 4, 1288-1303; Macher, B.A. , et al (1991) Glycobiology, 
vol. 1, 577-584) . Thresholds for antigen-positivity were set 
at a fluorescence intensity level that excludes 99% of COS-7 
cells that had been transfected with the negative control 
vector and stained with the test antibody. These thresholds 
were set individually for each test antibody, on the day of 
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each experiment, before the experimental transf ectants were 
analyzed. 

Fucosvltransf erase assays - Extracts for cr(l,3)Fuc-T 
assays were prepared by harvesting COS-7 cells 72 hours after 
transf ect ion, and solubilizing the cell pellets in 1% Triton 
X-100 (Kukowska-Latallo, J. P., et al (1990) Genes Devel., vol. 
4, 1288-1303; Lowe, J.B. , et al (1991) J. Biol. Chem., vol. 
266 r 17467-17477; Weston, B.W., et al (1992) J. Biol. Chem., 
vol. 267, 4152*4160; Weston, B. W. , et al (1992) J. Biol. 
Chem. vol. 267, 24575-24584). Cell extract protein contents 
were determined with a micro BCA protein assay reagent 
(Pierce) . Fucosyl transf erase assays were performed in a final 
volume of 20 Ml, and contained 25 mM sodium cacodylate (pH 
6.2), 5 mM ATP, 10 mM L-fucose, 20 mM MnCl 2 and 3 mM 
GDP- [ 14 C] fucose. Cell extract protein concentration was varied 
to approximate linear reaction rate conditions (net 
[ U C] -fucose incorporation rate of a less than twenty percent 
of total input GDP- [ 14 C] fucose radioactivity). Concentrations 
of low molecular weight acceptor substrates (Galfil-+4GlcMAc, 
N-acetyllactosamine, LacNAc; or GalBl-OGlctfAc, lacto-tf-biose 
I, LtfB-I) were varied between 0.25 mM and 167 mM, for apparent 
Km determinations. Reactions were incubated at 37 °C for one 
hour and were terminated by the addition of 20 Ml ethanol, 
followed by dilution with 560 ml water. The total 
radioactivity in each reaction was determined by subjecting an 
aliquot to scintillation counting. To assess the amount of 
radioactive fucose incorporated into product, an additional 
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aliquot was applied to a Dowex 1x2-400 column in the formate 
form (Sasaki, K. , et al (1994) J. Biol. Chem., vol. 269, 
14730-14737; Natsuka, S., et al (1994) J. Biol. Chem., vol. 
269, 16789-16794; Macher, B.A., et al (1991) Glycobiology , 
vol. 1, 577-584; Natsuka, S., and Love, J.B. (1994) Curr. 
Opin. Struct. Biol., vol. 4, 683-691). The flow through 
fraction plus an additional 2 ml of a water column "wash" was 
collected, pooled and subjected to scintillation counting. 
Control assays containing no added acceptor substrate were 
also completed in parallel. Radioactivity in the flow through 
fraction, obtained in the absence of added acceptor, was no 
more than 1 % of the total added radiolabel. 

In some instances, a fucosyltransf erase was found to 
maintain a very low inherent specific activity towards one or 
both low molecular weight acceptor substrates, such that the 
rate of transfer was found to be directly proportional to 
acceptor substrate concentrations between 0.25 mH and at least 
167 mH. In these instances, the data in Table 3 indicate that 
the apparent Michael is -Menten constant exceeds 50 mM. As a 
measure of the relative efficiencies with which the enzymes 
utilize N-acetyllactosamine and lacto-N-biose I, the specific 
activity of each enzyme was determined, for each of these two 
acceptor substrates (20 mM concentration) . The ratio of the 
specific activity was calculated and is displayed in Table 3. 

Chloramphenicol acetvltransf erase assays - Transf ected 
COS-7 cells were harvested 72 h after transf ect ion, and 
extracts were prepared from them by sonicating the cells in 
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0.25M Tris-HCl (pH 8.0). Chloramphenicol acetyltransf erase 
assays were performed as previously described (Smith, P.L., 
and Lowe, J.B. (1994) J. Biol. Chem. , vol. 269, 15162-15171; 
Ausubel, J., and Frederick, M. (1987) Molecular Biology- 
Laboratory Manuals, Vols. 1 and 2, John Wiley and Sons, New 
York) . Total cell extract protein in each reaction was 
adjusted to produce less than 15% incorporation of 
[ 14 C] -chloramphenicol during a 1 h incubation at 37°C. 

Western Blot analysis - Cell extracts were prepared from 
transfected COS -7 cells 72 h after transf ection. Extracts 
contained 50 mH Tris-HCl (pH 6.8), 10% SDS, and 10% glycerol. 
Extracts were boiled for 3 min immediately after preparation, 
arid were stored frozen until use. Protein content was 
determined using the BCA reagent procedure. Extracts were 
prepared for SDS-PAGE by adding dithiothreitol to a final 
concentration of 0.1 M, and bromophenol blue to a final 
concentration of 0*05%. Samples were then boiled and 
fractionated by electrophoresis through a 10% SDS- 
polyacrylamide gel. After electrophoresis, the proteins were 
electrotransf erred to a PVDF membrane (Biorad) . The membrane 
was rinsed and then blocked for 12-14 h at 4°C in phosphate 
buffered saline, pH 7.4, containing 10% bovine serum albumin 
and 0.2% Tween 20. The blot was washed at room temperature in 
phosphate buffered saline, pH 7.4, 0.2% Tween 20 and was 
probed with a 1:1000 dilution of a polyclonal rabbit antiserum 
prepared against a fragment of the human Fuc-TIII catalytic 
domain expressed in E. coli. Alternatively, an identically 
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prepared blot was probed with a 1:1000 dilution of pre- immune 
rabbit serum. The blot was then washed, and then probed with a 
1:5000 dilution of a horse radish peroxidase-conjugated anti- * 
rabbit immunglobulin (Sigma) . The blot was then rinsed, 
exposed to ECL reagent (Amersham, UK) , and subjected to 
autoradiography . 

RESULTS 

Analysis of COOH-terminal subdomain exchanges - type I 
substrate utilization is determined by sequences in the NHj- 
terminal "hypervariable" segment - The amino acid sequences of 
human Fuc-TIII and Fuc-TVI differ from each other at only 7 
corresponding positions, COOH-terminal to residue 152/151 in 
Fuc-TIII /Fuc-TVI (Fig 9a) . By contrast, their amino acid 
sequences differ substantially in a "hypervariable" region 
bounded by their transmembrane segments (residue 37) and the 
beginning of their "sequence-constant" COOH-terminal domains 
(residues 152 and 151) . It therefore seemed most likely that 
peptide sequence (s) within their respective "hypervariable" 
segments determine whether or not each enzyme can (Fuc-TIII; 
Lowe, J.B., et al (1990) Cell, vol. 63, 475-484; Kukowska- 
Latallo, J.F., et al (1990) Genes Devel., vol. 4, 1288-1303) 
or cannot (Fuc-TVI; Weston, B. W. , et al (1992) J. Biol. 
Chem., vol. 267, 24575-24584) (Fig 9a) efficiently utilize 
type I acceptor substrates* 

This hypothesis was tested by first characterizing the * 
substrate utilization patterns of recombinant Fuc-T chimeras 
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in which the nearly identical COOH-termini of Fuc-TIII and 
Fuc-TVI were exchanged with each other (Figs. 9a and 10). 
Substrate utilization was determined by expressing recombinant 
chimeras in COS-7 cells, which maintain acceptor substrate 
precursors for surface-localized Le x , sLe x , Le a and sLe* 
epitopes (Lowe, J.B., et al (1990) Cell, vol. 63, 475-484; 
Kukowska-Latallo, J.F., et al (1990) Genes Devel., vol. 4, 
1288-1303) , and characterizing the chimera -determined cell 
surface glycosylation patterns of the transfected cells using 
flow cytometry. 

Four chimeras informative for the COOH-termini of these 
enzymes were evaluated. In two of these chimeras (Fuc-TCl and 
Fuc-TC3) , the exchanged segments were between amino acid 
residue 301 of Fuc-TIII or 300 of Fuc-TVI (at positions 
corresponding to a BstXI site in both genes; Fig. 9a and Fig. 
10) and the COOH-terminus of each enzyme. This represents a 
60 amino acid long peptide segment in Fuc-TIII, and a 59 
residue long peptide in Fuc-TVI. These segments differ from 
each other at three amino acid positions and by the presence 
(Fuc-TIII) or absence (Fuc-TVI) of a valine residue at 
position 353 (in Fuc-TIII) (Fig. 9a) . In the other two 
chimeras (Fuc-TC2 and Fuc-TC4; Fig. 10), the entire COOH- 
terminal regions were exchanged. These transposed segments 
encompass all seven amino acid sequence differences distal to 
the Bgl II site corresponding to amino acid 160 in Fuc-TIII and 
159 in Fuc-TVI (Figs. 9a and b and 10) . COS-7 cells 
transfected with chimera Fuc-TC3 and Fuc-TC4 (which maintain 
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the NH 2 -terminal segment of Fuc-TVI , and portions of the COOH- 
terminus of Fuc-TIII) express the Le* and sLe x antigens, but 
not the type I-based Le a or sLe 8 epitopes (Fig- 10) . By 
contrast, COS-7 cells transfected with chimera Fuc-TCl and 
chimera Fuc-TC2 (which maintain the NH 2 -terminal segment of 
Fuc-TIII, and portions of the COOH-terminus of Fuc-TVI) 
express the Le x , Le a and sLe a epitopes; however, neither 
chimera determines expression of the sLe x antigen (Fig. 10). 
The reason that these latter two chimeras do not yield sLe x 
expression is not known. Nonetheless, these results clearly 
demonstrate that the NH 2 -terminal "hypervariable" segments of 
Fuc-TIII and Fuc-TVI determine their respective abilities to 
utilize or disregard the type I acceptor substrates expressed 

by COS- 7 cells. 

This conclusion is reinforced by in vitro determinations 
of the relative efficiencies with which the 

fucosyltransf erases utilize a pair of representative, neutral 
low molecular weight type I and type II acceptor substrates, 
tf-acetyllactosamine and lacto-tf-biose I (Table 3). Wild type 
Fuc-TIII exhibits similar Michaelis-Menten constants for these 
type II and type I substrates (8.1 and 12.7 mM, respectively), 
although the enzyme exhibits a substantially higher relative 
specific activity with the type I substrate. By contrast, 
Fuc-TVI maintains a relatively low apparent 1^ for the type II 
acceptor (4.9 mM) , a relatively high apparent Michealis-Menten 
constant for the type I acceptor (greater than 50 mM) , and 
exhibits a substantially higher relative specific activity 
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towards the type II substrate (Table 3). The similarity 
between FUC-TC4 and Fuc-TVT observed by flow cytometry 
analysis (Fig. 10) is also evident from the enzymatic 
analyses; Fuc-TC4 exhibits apparent Michealis-Menten constants 
for the two acceptors, and a specific activity ratio, that do 
not differ substantially from those obtained with wild type 
Fuc-TVI (Table 3). FUC-TC2, the chimeric counterpart to 
Fuc-TC4 exhibits an inherently low specific activity towards 
both substrates, as reflected by the high apparent Km's for 
the two substrates (Table 3). Nonetheless, this enzyme, like 
wild type Fuc-TIII, utilizes the type I acceptor substrate 
much more efficiently than it does the type II acceptor (Table 
3). This result is also consistent with the observation that 
Fuc-TC2 directs type I product expression in COS-7 cells with 
an efficiency that equals or exceeds its ability to determine 
type II product expression (Fig. 10) . 

Analysis of single subdomain exc hanges points to 
subdomains 4 and 5 in type I subst rate utilization - To 
further localize peptide sequence (s) in the NH 2 -terminal 
"hypervariable" segments that are important to type I and type 
II acceptor utilization, we arbitrarily divided these segments 
into five distinct subdomains. The subdomains represent short 
peptide segments whose sequences are different in Fuc-TIII and 
Fuc-TVI. Subdomains are separated from each other by short 
peptide sequences that are identical in the two enzymes. We 
used site-directed mutagenesis procedures ("Experimental 
Procedures ) to install restriction sites in the DNA sequences 



WO 97/09421 




CT/US96/13816 



encoding Fuc-TIII and Fuc-TVI at positions corresponding to 
the inter-subdomain conserved sequence regions. These sites 
were located at the same relative positions in these genes, < 
and were designed to be unique, and to conserve the wild type 
inter-subdomain amino acid sequences of the two enzymes (Figs. 
9a and b). These sites allow the use of simple restriction 
fragment interchange procedures to create oc(l,3)Fuc-T chimeras 
in which one (or more) subdomain(s) from one enzyme are 
interchanged with the corresponding subdomain(s) in the other 
enzyme. 

We used this approach to construct a series of seventeen 
additional recombinant ot(l,3)Fuc-T chimeras derived from Fuc- 
TIII and Fuc-TVI. Each chimera was evaluated for its ability 
to determine expression of cell surface Le a , sLe 8 , Le x and sLe* 
epitopes in transfected COS-7 cells, using the flow cytometry 
procedures described above for chimeras Fuc-TCl, Fuc-TC2, Fuc- 
TC3 , and FUC-TC4. 

A series of single subdomain transpositions were used to 
identify either a Fuc-TVI subdomain that inactivates Fuc-TIII 
type I substrate utilization, or a Fuc-TIII subdomain that 
confers type I acceptor utilization when transposed into Fuc- 
TVI (Figs, lla and lib). Each of the five Fuc-TIII chimeras 
containing a single Fuc-TVI subdomain expresses each of the 
four cell surface antigens determined by wild type Fuc-TIII 
(Fig. lla) . Nonetheless, two of these chimeras (Fuc-TC8 and 
Fuc-TC9) , representing substitutions in subdomain 4 or 5, * 
respectively, yield a cell surface antigen profile that 
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differs substantially from the Fuc-TIII profile. These two 

■ 

chimeras yield populations of transf ectants that exhibit a 

* relative reduction in the number of cells expressing the type 

I-based Le a and sLe 8 antigens, and a relative increase in the 
fraction of cells expressing the type I I -based epitopes Le x and 
sLe*. Specifically, the ratio of the fraction of Le fl -positive 
cells (29.8%) to the fraction of Le x -positive cells (37.2%) for 
wild type Fuc-TIII in this experiment is 0.80. By contrast, 
for Fuc-TC8, this ratio is 0.19 (9.1% Le e positive versus 47.9% 
Le x positive), and the ratio is 0.29 for Fuc-TC9 (10.2% Le B 
positive versus 35.4% Le x positive), similar ratios are 
observed using results obtained with antibodies against the 
sialylated forms of these antigens (Fig. 11a) • These two 
chimeras thus yield a glycosylation phenotype that is roughly 
intermediate between the phenotypes observed for Fuc-TIII and 
Fuc-TVI. In contrast to the results shown in Figure 11a, 
substitution of single Fuc-TIII subdomains at corresponding 
positions in Fuc-TVI yields a set* of chimeras which each 
determine a glycosylation phenotype virtually identical to 
that determined by wild type Fuc-TVI (Fig. lib) . Taken 
together, these results indicate that sequences within 
subdomain four and five of Fuc-TIII, by themselves, can 
strongly, if not completely, influence the type I acceptor 

: substrate utilization efficiency of this enzyme. 

Analysis of multiple subdomain exchanges - type I 

* substrate utilization is determined bv composites of sequences 

in subdomains 3. 4. or 5 - To determine if peptide sequence 



% 
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within two or more adjacent domains can more fully direct type 
I acceptor substrate discrimination, we constructed and tested 
a series of chimeras assembled via multiple subdomain 
exchanges. Chimera Fuc-TC15 f containing subdomains 1, 2 and 3 
from Fuc-TVI transposed into the wild type Fuc-TIII sequence, 
generates levels of cell surface Le x , sLe x , Le e and sLe a 
determinants similar to that generated by wild type Fuc-TIII 
(Fig. 12a)- Likewise, the complementary chimera Fuc-TC17 
(Fig. 12b) utilizes only type II acceptor substrates to 
generate Le x and sLe x , in a manner virtually identical to wild 
type Fuc-TVI. These results indicate that domains 1, 2, and 
3> together contribute little, if anything, to type I acceptor 
substrate utilization. 

By contrast, results obtained with an additional set of 
chimeras indicates that subdomains 4 and 5, when transposed 
together (with or without subdomain 3) , strongly influence 
type I substrate usage* Replacement of subdomains 3, 4 and 5 
in Fuc-TIII with the corresponding subdomains from Fuc-TVI 
yields a fucosyltransf erase chimera (Fuc-TC16; Fig. 12a) that 
utilizes type II substrates exclusively, to generate cell 
surface Le x or sLe x determinants. This phenotype is virtually 
identical to that generated by wild type Fuc-TVI (Fig. 12a) . 

In a similar manner, replacement of subdomains 3, 4 and 
5 in Fuc-TVI with the corresponding subdomains from Fuc-TIII 
creates a fucosyltransf erase chimera (Fuc-TC18; Fig. 12b) with 
a phenotype qualitatively similar to that of wild type Fuc- 
TIII. Specifically, COS-7 cells expressing this chimera 
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express all four epitopes analyzed (Fig. 12b) , although at 
reduced absolute levels (Le x , 8-1% positive; sLe x , 3.0% 
positive; Le a , 4.8% positive; and sLe a , 4.0% positive) relative 
to levels generated by wild type Fuc-TIII. The diminished 
amount of cell surface antigen expression yielded by Fuc-TC18 
is apparently not a consequence of decreased expression of the 
chimeric enzyme since western blot analysis of Fuc-TC18 
transfectants identifies amounts of immunoreactive Fuc-TC18 in 
these cells that are approximately equivalent to the amounts 
of wild type Fuc-TVI, and nearly equivalent to wild type Fuc- 
TIII accumulations. Enzyme activity assays suggest that the 
decreased cell surface fucosylated antigen expression we 
observe with this chimeric fucosyltransf erase is due to the 
enzyme's inherently low specific activity, and not because it 
is mislocalized or otherwise not able to effectively interact 
with its intracellular target oligosaccharides. a(l,3)Fuc-T 
activity assays indicate that the specific activity of this 
chimera is substantially lower than either Fuc-TIII or Fuc- 
TVI, when assayed with the low molecular weight type I and 
type II acceptor substrates lacto-ff-biose I and N- 
acetyllact os amine (apparent JC ro 's exceeding 50 mM for these 
substrates; Table 3). Nonetheless, this chimeric enzyme 
exhibits the same in vitro "preference" for the type I 
acceptor substrate observed for wild type Fuc-TIII (Table 3) . 
Thus, even though the inherent efficiency of this chimeric 
fucosyltransf erase is apparently substantially less than that 
of the wild type enzyme, the presence in this chimera of 
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subdomains 3, 4, and 5 from Fuc-TIII is associated with 
retention of a phenotype that is qualitatively similar to wild 
type Fuc-TIII r in vitro (Table 3) and in vivo (Fig. 12b). 

Subdomain 3 in Fuc-TIII and Fuc-TVI varies only by the 
presence (Fuc-TVI) or absence (Fuc-TIII) of a consensus 
sequence for asparagine- linked glycosylation (Fig. 9b) . This 
sequence, by itself, is without apparent effect on acceptor 
substrate utilization as judged from transfection analyses, 
since exchange of this subdomain alone does not alter the 
glycosylation phenotype determined by the corresponding mutant 
Fuc-TIII and Fuc-TVI chimeras (Fig. 11a , Fuc-TC7; and lib, 
Fuc-TCl2) . Thus, when this observation is considered together 
with other data in Figures 10 and 11a and b, and with the 
results obtained with chimeras Fuc-TC15, Fuc-TC16, Fuc-TC17, 
and Fuc-TC18, it can be supposed that type I acceptor 
substrate discrimination (i.e., a(l,4)Fuc-T activity) can be 
conferred upon a "generic" a(l,3)Fuc-T scaffold, derived from 
either Fuc-TIII or Fuc-TVI, by a set of sequences between the 
NH 2 -tenninal boundary of subdomain 4 and the COOH-terminal 
boundary of subdomain 5 . 

This hypothesis is confirmed by analysis of chimeras in 
which these two subdomains have been transposed together. 
These chimeric a(l,3)Fuc-Ts (Fuc-TC19 and Fuc-TC20) generate 
glycosylation phenotypes (Figs. 12a and b) that correspond 
nearly exactly to the phenotypes generated by chimeras 
constructed by exchange of subdomains 3, 4, and 5 (Figs. 12a 
and b) . Specifically, Fuc-TC19 (subdomains 4 and 5 from Fuc- 
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TVI installed into Fuc-TIII) utilizes type II acceptor 
substrates with an efficiency approximately equivalent to wild 
type Fuc-TVI, but does not determine expression of type I 
molecules (Fig. 12a) . The in vitro properties of this chimera 
are consistent with these flow cytometry results. Chimera 
Puc-TC19 maintains a relatively low apparent K u for the type II 
acceptor (1.1 mM) , a relatively high apparent Michealis-Menten 
constant for the type I acceptor (greater than 50 inM) , and 
displays a high relative specific activity towards the type II 
substrate (Table 3). The chimeric enzyme Fuc-TC19 is 
therefore similar to wild type Fuc-TVI, and to the other "Fuc- 
TVI-like M chimera Fuc-TC4 analyzed in vitro. 

The complementary chimera Fuc-TC20 (subdomains 4 and 5 
from Fuc-TIII transposed into the corresponding position in 
Fuc-TVI ) utilizes neutral and sialylated type I molecules, as 
well as neutral type II molecules, in vivo, to generate I*e a 
(6.3% positive cells), sLe a (3.3% positive cells) and Le* (5.4% 
positive cells) antigens. Western blot analyses indicate that 
Fuc-TC20 accumulates in COS-7 cells to levels nearly 
equivalent to those observed when wild type Fuc-TVI is 
expressed in this cell line. In vitro fucosyltransf erase 
assays demonstrate that this chimeric enzyme maintains an 
inherently low specific activity towards the type I and type 
II acceptor substrates examined (apparent JC n 's exceeding 50 mM; 
Table 3). However, Fuc-TC20 utilizes the type I acceptor 
substrate much more efficiently than it does the type II 
acceptor (Table 3), and thus maintains the in vitro type I 
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acceptor substrate preference exhibited by wild type Fuc-TIII. 
These results are analogous to those obtained with Fuc-TC18, 
above. They indicate that the low level of surface antigen 
expression determined by Fuc-TC20 is due at least in part to 
an inherently diminished general specific activity. However, 
they also indicate that this enzyme, like Puc-TC18 above, has 
acquired a Fuc-TIII-like phenotype by virtue of subdomains 4 
and 5 derived from Fuc-TIII. 

Fuc-TC20 is unusual in that it does not yield cell 
surface sialyl Lewis x expression, but does direct expression 
of the other three fucosylated cell surface antigens (Fig. 
12b) . Given that this chimera differs from Fuc-TCl8 only by 
the absence of a potential asparagine- linked glycosylation 
site within subdomain 3, the observations derived from these 
two chimeras suggested the possibility that display of an 
asparagine- linked oligosaccharide by subdomain 3, at least in 
the context of an otherwise intact Fuc-TVI scaffold, enables 
catalysis of sialyl Lewis x formation, whereas its absence 
disables sialyl Lewis x formation. This possibility is 
supported by the results obtained with chimeras Fuc-TCl and 
Fuc-TC2 (Fig. 10) , which determine a glycosylation phenotype 
analogous to that of Fuc-TC20 (Fig. 12b). In chimeras Fuc-TCl 
and Fuc-TC2, absence of an asparagine-linked oligosaccharide 
modification site in subdomain 3, in the context of a full 
(Fuc-TC2) or partial (Fuc-TCl) COOH-terminal Fuc-TVI scaffold, 
also correlates with an inability to determine cell surface 
sialyl Lewis x formation. However, we note that Fuc-TC12 
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(Fig. lib) , representing a fully wild type Fuc-TVI sequence, 
except for absence of the asparagine-linked glycosylation site 



surface sialyl Lewis x antigen (Fig. lib). Clearly then, the 
unusual phenotype exhibited by chimeras Fuc-TC20, Fuc-TCl, and 
Fuc-TC2, cannot be explained by the simple absence of a single 
asparagine- linked glycosylation site. 

In aggregate, the results obtained with the recombinant 
fucosyltransferase chimeras derived from Fuc-TIII and Fuc-TVI 
indicate that some complement of the 21 non- identical amino 
acid residues encompassed within subdomains 4 and 5 of 
Fuc-TIII and Fuc-TVI (Fig. 9b) can determine whether or not 
these enzymes efficiently utilize type I oligosaccharide 
acceptor substrates in COS-7 cells, and strongly influence 
their ability to discriminate between type I and type II 

acceptors in vitro. 

Subdomains 4 and 5 fr om Fuc-TV leave intact the type I 
substrate utilization activit y of Fuc-TIII - A third human 
ce(l,3)Fuc-T, termed Fuc-TV (Weston, B.W., et al (1992) J. 
Biol. Chem., vol. 267, 4152-4160), shares enzymatic properties 
with both Fuc-TIII and Fuc-TVI, and is very similar in its 
primary sequence to these enzymes* Specifically, Fuc-TV 
determines a cell surface glycosylation phenotype similar to 
that generated by Fuc-TVI when examined in transf ected cell 
lines (Weston, B.W. , et al (1992) J. Biol. Chero., vol. 267, 
4152-4160) , but can also utilize type I acceptor substrates at 
detectable efficiencies, when tested in vitro with low 



in subdomain 3, determines robust expression of the cell 
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molecular weight acceptor substrates (Weston, B.W., et al 
(1992) J. Biol. Chem., vol. 267, 4152-4160; Weston, B. W. , et 
al (1992) J. Biol. Chem., vol. 267, 24575-24584). Its primary 
sequence across subdomains 4 and 5 is equally similar to both 
Fuc-TIII and Fuc-TVI (34 identical residues across spans of 
47; Fig. 13). This enzyme and its properties therefore 
present an opportunity to further define type I substrate 
utilization-determining amino acid residues within subdomains 
4 and 5, by creating and testing a chimera in which subdomains 
4 and 5 from Fuc-TV are exchanged with the corresponding 
domains in Fuc-TIII. The results of these manipulations 
indicate that the robust type I substrate utilization is 
maintained when subdomains 4 and 5 within Fuc-TIII are 
substituted with the corresponding subdomains from Fuc-TV 
(Fig. 12a). These results, when considered together with 
those obtained with the chimeras derived from Fuc-TIII and 
Fuc-TVI, indicate that the type I acceptor substrate 
utilization properties of these enzymes can be controlled by a 
quite limited subset of the 21 non-identical amino acid 
residues suggested to be important by the Fuc-TIII: Fuc-TVI 
chimera experiments (Fig. 13). 
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These experiments demonstrate that utilization of type I 
oligosaccharide substrates by the human 

a(l,3)fucosyltransferases Fuc-TIII, Fuc-TV, and Fuc-TVI is 
determined in large measure by a relatively short peptide 
segment, encompassed by subdomains 4 and 5, and localized to 
the NH 2 -terminus of each enzyme's Golgi lumenal domain. 
Examination of the residues within this segment, in each 
enzyme, suggests the simple possibility that, in the context 
of a "generic" a ( 1 , 3 ) f ucosyltransf erase background, efficient 
type I substrate utilization is allowed by residues shared by 
Fuc-TIII and Fuc-TV and found within subdomains 4 and 5, 
whereas type I substrate utilization is "disabled" by residues 
in this segment that are unique to Fuc-TVI. 

In the context of this simple model, the more obvious 
"Fuc-TVI -unique" amino acid residues (Fig. 13) that might 
operate singly, or in concert with other neighboring residues, 
to disable type I acceptor substrate utilization, include the 
non-conservative differences represented by: (i) the arginine 
residue at position 110 in Fuc-TVI, that corresponds to a 
tryptophan residue that occupies the analogous position in 
both Fuc-TIII and Fuc-TV; (ii) a pair of arginine residues in 
Fuc-TVI at positions (122 and 126) that correspond to a pair 
of proline residues in Fuc-TIII and Fuc-TV; (iii) a tryptophan 
residue (143) in Fuc-TVI that corresponds to glutamine or 
arginine in Fuc-TIII or Fuc-TV, respectively; and (iv) a 
lysine residue at position 146 in Fuc-TVI, that corresponds to 
a glutamic acid residue in Fuc-TIII and Fuc-TV. It is also 
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possible that type I acceptor substrate utilization may be 



differences identified in Figure 13. 

Although these studies have identified candidate residues 
for acceptor substrate discrimination, they do not allow us to 
discern the mechanism (s) through which these residues operate 
in this process. However, previous work by others in this 
area suggests a role for hydrogen bonding interactions in 
acceptor substrate binding and utilization. For example, in 
an evaluation of deoxygenated oligosaccharide acceptor 
substrates for two different human fucosyltransferases 
(corresponding to Fuc-TIII and Fuc-TVT) , Hindsgaul and 
colleagues report that acceptor substrates deoxygenated at the 
reactive hydroxy 1 position do not act as inhibitors of these 
enzymes (Hindsgaul, O., et al (1991) J . Bio. Chem., vol. 266, 
17858-17862). Hindsgaul et. al. interpret these results to 
mean that these fucosyltransferases, as well as other 
glycosyltransf erases, bind to their acceptor substrates, and 
subsequently effect catalysis, through hydrogen bonding 
interactions between the reactive hydroxyl group on the 
acceptor and one or more basic groups in the enzyme 
(Hindsgaul, 0., et al (1991) J. Bio. Chem., vol. 266, 17858- 
17862). In the context of this hypothesis, we note that, in 
aggregate, the M Fuc-TVI-specif ic M residues shown in Figure 13 
are largely basic in character, relative to the corresponding 
residues in Fuc-TIII and Fuc-TV. The non-conserved tryptophan 
residues in subdomains 4 and 5, and other such polar residues, 



influenced by the more conservative amino acid sequence 
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may also mediate f ucosyltransferase acceptor substrate 
recognition/ again presumably through hydrogen bonding 
interactions, as suggested from studies of 

B(l, 4)galactosyltransferase (Aoki, D., et al (1990) EMBOJ., 
vol. 9, 3171-3178). Whether or not these residues operate to 
determine binding to, and fucosylation of, type II acceptor 
substrates exclusively (and perhaps at the expense of type I 
acceptor substrate binding) remains to be experimentally 
determined. 

On a more global scale, it seems probable that 
fucosyltransf erase-specif ic substrate recognition and 
modification properties will also be found to be more than 
simply a function of the ability of the relevant subdomains to 
"bind 11 to the oligosaccharide acceptor. For instance, ve can 
anticipate that some of the residues in these subdomains may 
interact in space with amino acids far away along the linear 
sequence of these enzyme, to alter acceptor substrate 
utilization. Alternatively, residues in these subdomains may 
influence acceptor substrate recognition by residues elsewhere 
in the molecule, via transmitted conformational change 
dictated by the specific residues within these subdomains. 
Finally, it is possible that subdomain-dependent proteolytic 
events indirectly determine acceptor substrate specificity by 
controlling the liberation of distinct soluble 
fucosyltransferase fragments with different acceptor substrate 
specif icit ies . 
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It is emphasized that the present studies have focused on 
detecting the products of these enzymes at a relatively low 
level of resolution (i.e., at the level of cell surface 
antigen expression) . This approach is advantageous in that it 
measures product (s) formed by an enzyme within its correct 
cellular environment. Furthermore, since the fraction of 
antigen-positive cells we observe correlates very well with 
catalytic activities measured in vitro, this parameter seems 
to accurately reflect the biochemical nature of the enzyme 
under study. By contrast, the mean fluorescent intensities of 
the antigen-positive cells fall within a similar range, 
regardless of the enzyme analyzed, and do not correlate with 
iix vitro activity. We do not understand this phenomenon, 
although we speculate that this can be accounted for by the 
possibility that acceptor substrate concentration and/or 
accessibility within COS cells is limiting to the point that 
utilization of the small amounts of the available acceptor 
substrate molecules yields a virtually maximal, though easily 
detectable, signal. The broad spectrum of enzyme protein 
expression level obtained within a population of transiently 
transf ected cells may play upon such limiting substrate levels 
to yield an off -on, or digital signal response that is a 
function of the efficiency of the enzyme under study. For 
example, in a population of cells expressing an enzyme with 
robust inherent activity, many cells, including those that 
express a relatively small amount of enzyme protein , will 
contain sufficient intracellular enzyme activity to convert 
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them to antigen-positivity (hence the relatively larger 
fraction of antigen-positive cells seen with efficient 
enzymes) , even though the limiting amounts of substrate 
operate to place a strict upper boundary on the "brightness" 
of such cells. By contrast, in a population of cells 
expressing an enzyme with a low inherent activity, only the 
relatively small fraction of cells that have accumulated 
relatively large amounts of enzyme protein will contain 
sufficient enzyme activity to exceed the threshold for 
substrate conversion (hence the relatively small fraction of 
antigen positive cells with inefficient enzymes) , and will 
become antigen-positive , with a mean fluorescence intensity 
that can be roughly maximal. Additional work will be required 
to confirm or refute this notion. 

The present approach cannot tell much about the nature of 
the glycoconjugate(s) that displays the fucosylated antigens, 
or about the structures of the glycocon jugate precursors that 
serve as substrates for these enzymes within these cells 
(aside from the fact that some of these moieties terminate 
with neutral and oc2-»3sialylated type I and type II 
structures). It will be important to eventually know this 
information, however, since it is possible that some of the 
apparent subdoma in-dependent acceptor substrate specificity we 
observe is also a function of the type of glycoconjugate that 
displays the acceptor di- or tri-saccharide moieties with 
which we have concerned ourselves here. It is noted that 
there are well known precedents for this type of 
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cis-raodulation of glycosyltransf erase substrate specificity by 
the underlying glycoprotein sequence (Kornfeld, S. (1987) 
FASBB J., vol. l f 462-468; Baenziger, J.U. (1994) F&SEB J., 
vol. 8, 1019-1025) . 

Table 3. In vitro apparent K a 's and relative fucosyltransf erase 
activities of recombinant Fuc-T chimeras. 



Fucosvltrans- 


Aooarent 


Araraarent 


Specific 


f erase 


LacNAc. mM 




Activity Ratio 








LacNAc/LWB-X 


Fuc-TIII 


8.1 


12.7 


0.12 


Fuc-TVX 


4.9 


>50 


2610 


Fuc-TC2 


>50 


>50 


0.03 


FUC-TC4 


5o8 


>50 


3646 


Fuc-TC18 


>50 


>50 


0.05 


FUC-TC19 


1.1 


>50 


189 


FUC-TC20 


>50 


>50 


0.04 
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The structures of the present chimeric anzyaas are shown 



in Table 4 



TABLE 4 



4 mOf9 M DMA 


B*tif*—*F and eomnansnt. amino aci.d ir<2!8 xdu&Q UQQd 




Pue-TIII / 1— 301 t nlua Fue— TVI 1301— 359 1 


FUC"TCa 


Rue— TTTT / 1— 160 1 nlua Fuc— TV! ( 160— 359 \ 


FUC-TC3 


cUC iVA ( i JuU ) p&UD cuw &a2^|«)w£ Wa) 


Fuc-TC4 


FUC— TVI ( 1— 1S9 ) pXUS FUC— TI IX ( lOl— JOl) 


FUC-TC5 


Fuc-TIII (1-33) plus Fuc-TVI ( 34-69 ) plus Fuc-TIII (71-361) 


Fuc-TC6 


Fuc-TIII (1-70) plus Fuc-TVI (70-85) plus Fuc-TIII (87-361) 


FUC-TC7 


Fuc-TIII(l-86) plus Fuc-TVI (86-98) plus Fuc-TIII (100-361) 


Fuc-TC8 


Fuc-TIII (1-99) plu© Fuc-TVI (99-129) plus Fuc-TIII (131-361) 


FUC—TC9 


Fuc-TIII (1-130) plus Fuc-TVI (130-159) plus Fuc-TIII (161-361) 


Fuc-TCIO 


Fuc-TVI (1-33) pluo Fuc-TIII (34-70) pluei Fuc-TVI (70-359) 


Fuc-TCll 


Fuc-TVI ( 1-69 ) plus Fuc-TII (71-86) plu® Fuc-TVI (86-359) 


Fuc-TC12 


Fuc-TVI (1-85) plus Fuc-TIII (87-99) plus Fuc-TVI (99-359) 


PUC-TC13 


Fuc-TVI (1-98) plus Fuc-TIII (100—130) plus Fuc-TVI (130-359) 


Fuc-TC14 


Fuc-TVI ( 1-129 ) plus Fuc-TIII ( 131-160) plus Fuc-TVI (160-359) 


Fuc-TCIS 


Fuc-TIII (1-33) plus Fuc-TVI (34-98) plus Fuc-TIII (100-361 ) 


Puc-TC16 


Fuc-TIII (1-86) pluB Fuc-TVI (86-159) plus Fuc-TIII (161-361) 


Fuc-TC17 


Fuc-TVI (1-33) plus Fuc-TIII (34-99) plus Fuc-TVI ( 99-359 ) 


Fuc-TC18 


Fuc-TVI (1-85) plus Fuc-TIII (87-160) plus Fuc-TVI (160-359) 


FUC-TC19 


Fuc-TIII (1-99) plus Fuc-TVI ( 99-159 ) plus Fuc-TIII (161-361) 


Fuc-TC20 


Fuc-TVI(l-98) plus Fuc-TIII (100-160) plus Fuc-TVI ( 160-359 ) 


Fuc-TC21 


Fuc-TIII (1-99) plus Fuc-TV(116-166) plus Fuc-TIII ( 161-361 ) 



Lyopholized plasaids containing DNA encoding these 
polypeptides were deposited with the ATCC in Rockville, 
Maryland on Septesiber 7, 1995. 
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UoS. Application Serial Number 08/393,246, filed on 
February 23, 1995, is incorporated herein by reference, in its 
entirety. 

Obviously, numerous modifications and variations of the 
present invention are possible in light of the above 
teachings. It is therefore to be understood that, within the 
scope of the appended claims, the invention may be practiced 
otherwise than as specifically described herein,. 



\ 
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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: LOWE, JOHN B. 

(ii) TITLE OF INVENTION: METHODS AND PRODUCTS FOR THE SYNTHESIS 
OF OLIGOSACCHARIDE STRUCTURES ON GLYCOPROTEINS, 
GLYCOLIFIDS, OR AS FREE MOLECULES, AND FOR THE ISOLATION 
OF CLONED GENETIC SEQUENCES THAT DETERMINE THESE STRUCTU 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: OBLON, SPIVAK, MCCLELLAND, MAIER & NEUSTADT, 

P.C. 

(B) STREET: 1755 Jefferson Davis Highway, Fourth Floor 

(C) CITY: Arlington 

(D) STATE: Virginia 

(E) COUNTRY: U.S.A. 

(F) ZIP: 22202 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 20-JUL-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Lavalleye, Jean-Paul M. P. 

(B) REGISTRATION NUMBER: 31,451 

(C) REFERENCE /DOCKET NUMBER: 2363-060-55 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703)521-4500 

(B) TELEFAX: (703)486-2347 

(C) TELEX: 248855 OPAT UR 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2043 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: cDNA * 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AGGAAACCTG CCATGGCCTC CTGGTGAGCT GTCCTCATCC ACTGCTCGCT GCCTCTCCAG 60 

ATACTCTGAC CCATGGATCC CCTGGGTGCA GCCAAGCCAC AATGGCCATG GCGCCGCTGT 120 

CTGGCCGCAC TGCTATTTCA GCTGCTGGTG GCTGTGTGTT TCTTCTCCTA CCTGCGTGTG 180 

TCCCGAGACG ATGCCACTGG ATCCCCTAGG GCTCCCAGTG GGTCCTCCCG ACAGGACACC 240 

ACTCCCACCC GCCCCACCCT CCTGATCCTG CTATGGACAT GGCCTTTCCA CATCCCTGTG 300 

GCTCTGTCCC GCTGTTCAGA GATGGTGCCC GGCACAGCCG ACTGCCACAT CACTGCCGAC 360 

CGCAAGGTGT ACCCACAGGC AGACACGGTC ATCGTGCACC ACTGGGATAT CATGTCCAAC 420 

CCTAAGTCAC GCCTCCCACC TTCCCCGAGG CCGCAGGGGC AGCGCTGGAT CTGGTTCAAC 480 

TTGGAGCCAC CCCCTAACTG CCAGCACCTG GAAGCCCTGG ACAGATACTT CAATCTCACC 540 

ATGTCCTACC GCAGCGACTC CGACATCTTC ACGCCCTACG GCTGGCTGGA GCCGTGGTCC 600 

GGCCAGCCTG CCCACCCACC GCTCAACCTC TCGGCCAAGA CCGAGCTGGT GGCCTGGGCG 660 

GTGTCCAACT GGAAGCCGGA CTCAGCCAGG GTGCGCTACT ACCAGAGCCT GCAGGCTCAT 720 

CTCAAGGTGG ACGTGTACGG ACGCTCCCAC AAGCCCCTGC CCAAGGGGAC CATGATGGAG 780 

ACGCTGTCCC GGTACAAGTT CTACCTGGCC TTCGAGAACT CCTTGCACCC CGACTACATC 840 

ACCGAGAAGC TGTGGAGGAA CGCCCTGGAG GCCTGGGCCG TGCCCGTGGT GCTGGGCCCC 900 

AGCAGAAGCA ACTACGAGAG GTTCCTGCCA CCCGACGCCT TCATCCACGT GGACGACTTC 960 

CAGAGCCCCA AGGACCTGGC CCGGTACCTG CAGGAGCTGG ACAAGGACCA CGCCCGCTAC 1020 

CTGAGCTACT TTCGCTGGCG GGAGACGCTG CGGCCTCGCT CCTTCAGCTG GGCACTGGAT 1080 

TTCTGCAAGG CCTGCTGGAA ACTGCAGCAG GAATCCAGGT ACCAGACGGT GCGCAGCATA 1140 - 

GCGGCTTGGT TCACCTGAGA GGCCGGCATG GTGCCTGGGC TGCCGGGAAC CTCATCTGCC 1200 

TGGGGCCTCA CCTGCTGGAG TCCTTTGTGG CCAACCCTCT CTCTTACCTG GGACCTCACA 1260 * 
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CGCTGGGCTT 


CACGGCTGCC 


AGGAGCCTCT 


CCCCTCCAGA AGACTTGCCT 


GCTAGGGACC 


1320 


TCGCCTGCTG 


GGGACCTCGC 


CTGTTGGGGA 


CCTCACCTGC 


TGGGGACCTC 


ACCTGCTGGG 


1380 


GACCTTGGCT 


GCTGGAGGCT 


GCACCTACTG 


AGGATGTCGG 


CGGTCGGGGA 


CTTTACCTGC 


1440 


TGGGACCTGC 


TCCCAGAGAC 


CTTGCCACAC 


TGAATCTCAC 


CTGCTGGGGA 


CCTCACCCTG 


1500 


GAGGGCCCTG 


GGCCCTGGGG 


AACTGGCTTA 


CTTGGGGCCC 


CACCCGGGAG 


TGATGGTTCT 


1560 


GGCTGATTTG 


TTTGTGATGT 


TGTTAGCCGC 


CTGTGAGGGG 


TGCAGAGAGA 


TCATCACGGC 


1620 


ACGGTTTCCA 


GATGTAATAC 


TGCAAGGAAA 


AATGATGACG 


TGTCTCCTCA 


CTCTAGAGGG 


1680 


GTTGGTCCCA 


TGGGTTAAGA 


GCTCACCCCA 


GGTTCTCACC 


TCAGGGGTTA 


AGAGCTCAGA 


1740 


GTTCAGACAG 


GTCCAAGTTC 


AAGCCCAGGA 


CCACCACTTA TAGGGTACAG 


GTGGGATCGA 


1800 


CTGTAAATGA 


GGACTTCTGG 


AACATTCCAA 


ATATTCTGGG 


GTTGAGGGAA 


ATTGCTGCTG 


1860 


TCTACAAAAT 


GCCAAGGGTG 


GACAGGCGCT 


GTGGCTCACG 


CCTGTAATTC 


CAGCACTTTG 


1920 


GGAGGCTGAG 


GTAGGAGGAT 


TGATTGAGGC 


CAAGAGTTAA AGACGAGCCT 


GGTCAATATA 


1980 


GCAAGACCAC 


GTCTCTAAAT 


AAAAAATAAT 


AGGCCGGCCA GGAAAAAAAA 


AAAAAAAAAA 


2040 



AAA 2043 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Pro Leu Gly Ala Ala Lys Pro Gin Trp Pro Trp Arg Arg Cys 
15 10 15 

Leu Ala Ala Leu Leu Phe Gin Leu Leu Val Ala Val Cys Phe Phe Ser 

20 25 30 

Tyr Leu Arg Val Ser Arg Asp Asp Ala Thr Gly Ser Pro Arg Ala Pro 
35 40 45 

Ser Gly Ser Ser Arg Gin Asp Thr Thr Pro Thr Arg Pro Thr Leu Leu 
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50 



55 



60 



He Leu Leu Trp Thr Trp Pro Phe His He Pro Val Ala Leu Ser 
65 70 75 



80 



Cys Ser 



Glu Met Val Pro Gly Thr Ala Asp Cys His He Thr Ala Asp 
85 90 95 



Arg Lys Val Tyr 

100 



Gin Ala Asp Thr Val He Val His His Trp 

105 110 



He Met Ser Asn Pro Lys 
115 



Arg Leu Pro Pro 
120 



Pro Arg Pro Gin 
125 



Gly Gin Arg Trp He Trp Phe Asn Leu Glu Pro Pro Pro Asn Cys Gin 
130 135 140 

His Leu Glu Ala Leu Asp Arg Tyr Phe Asn Leu Thr Met Ser Tyr Arg 
145 150 155 160 

Ser Asp Ser Asp He Phe Thr Pro Tyr Gly Trp Leu Glu Pro Trp Ser 

165 170 175 

Gly Gin Pro Ala His Pro Pro Leu Asn Leu Ser Ala Lys Thr Glu Leu 

180 185 190 



Val Ala Trp Ala Val 
195 



Asn Trp Lys Pro Asp 
200 



Ala Arg val 

205 



Tyr Tyr Gin 

210 



Leu Gin Ala His Leu Lys Val Asp Val Tyr Gly Arg 
215 220 



Ser His Lys Pro Leu Pro Lys Gly Thr Met Met Glu Thr Leu Ser Arg 

225 230 235 240 

Tvr Lys Phe Tyr Leu Ala Phe Glu Asn Ser Leu His Pro Asp Tyr He 

245 250 255 

Thr Glu Lys Leu Trp Arg Asn Ala Leu Glu Ala Trp Ala Val Pro Val 

260 265 270 



Val Leu Gly Pro 

275 



Ser Arg Ser Asn Tyr Glu Arg Phe Leu Pro Pro 

280 285 



Ala Phe He His Val Asp Asp Phe Gin 
290 295 



Pro Lys Asp Leu Ala Arg 
300 



Tyr Leu Gin Glu Leu Asp Lys Asp His Ala Arg Tyr Leu 
305 310 315 



Tyr Phe 
320 



Arg Trp Arg Glu Thr Leu Arg Pro Arg Ser Phe Ser Trp Ala Leu Asp 
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325 330 335 

Phe Cys Lys Ala Cys Trp Lys Leu Gin Gin Glu Ser Arg Tyr Gin Thr 

340 345 350 

Val Arg Ser lie Ala Ala Trp Phe Thr 
355 360 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCTTCCCTTG TAGACTCTTC TTGGAATGAG AAGTACCGAT TCTGCTGAAG ACCTCGCGCT 60 

CTCAGGCTCT GGGAGTTGGA ACCCTGTACC TTCCTTTCCT CTGCTGAGCC CTGCCTCCTT 120 

AGGCAGGCCA GAGCTCGACA GAACTCGGTT GCTTTGCTGT TTGCTTTGGA GGGAACACAG 180 

CTGACGATGA GGCTGACTTT GAACTCAAGA GATCTGCTTA CCCCAGTCTC CTGGAATTAA 240 

AGGCCTGTAC TACATTTGCC TGGACCTAAG ATTTTCATGA TCACTATGCT TCAAGATCTC 300 

CATGTCAACA AGATCTCCAT GTCAAGATCC AAGTCAGAAA CAAGTCTTCC ATCCTCAAGA 360 

TCTGGATCAC AGGAGAAAAT AATGAATGTC AAGGGAAAAG TAATCCTGTT GATGCTGATT 420 

GTCTCAACCG TGGTTGTCGT GTTTTGGGAA TATGTCAACA GAATTCCAGA GGTTGGTGAG 480 

AACAGATGGC AGAAGGACTG GTGGTTCCCA AGCTGGTTTA AAAATGGGAC CCACAGTTAT 540 

CAAGAAGACA ACGTAGAAGG ACGGAGAGAA AAGGGTAGAA ATGGAGATCG CATTGAAGAG 600 

CCTCAGCTAT GGGACTGGTT CAATCCAAAG AACCGCCCGG ATGTTTTGAC AGTGACCCCG 660 

TGGAAGGCGC CGATTGTGTG GGAAGGCACT TATGACACAG CTCTGCTGGA AAAGTACTAC 720 

GCCACACAGA AACTCACTGT GGGGCTGACA GTGTTTGCTG TGGGAAAGTA CATTGAGCAT 780 

TACTTAGAAG ACTTTCTGGA GTCTGCTGAC ATGTACTTCA TGGTTGGCCA TCGGGTCATA 840 
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TTTTACGTCA TGATAGACGA CACCTCCCGG ATGCCTGTCG TGCACCTGAA CCCTCTACAT 900 

TCCTTACAAG TCTTTGAGAT CAGGTCTGAG AAGAGGTGGC AGGATATCAG CATGATGCGC 9 SO 

ATGAAGACCA TTGGGGAGCA CATCCTGGCC CACATCCAGC ACGAGGTCGA CTTCCTCTTC 1020 

TGCATGGACG TGGATCAAGT CTTTCAAGAC AACTTCGGGG TGGAAACTCT GGGCCAGCTG 1080 

GTAGCACAGC TCCAGGCCTG GTGGTACAAG GCCAGTCCCG AGAAGTTCAC CTATGAGAGG 1140 

CGGGAACTGT CGGCCGCGTA CATTCCATTC GGAGAGGGGG ATTTTTACTA CCACGCGGCC 1200 

ATTTTTGGAG GAACGCCTAC TCACATTCTC AACCTCACCA GGGAGTGCTT TAAGGGGATC 12 SO 

CTCCAGGACA AGAAACATGA CATAGAAGCC CAGTGGCATG ATGAGAGCCA CCTCAACAAA 1320 

TACTTCCTTT TCAACAAACC CACTAAAATC CTATCTCCAG AGTATTGCTG GGACTATCAG 1330 

ATAGGCCTGC CTTCAGATAT TAAAAGTGTC AAGGTAGCTT GGCAGACAAA AGAGTATAAT 1440 

TTGGTTAGAA ATAATGTCTG ACTTCAAATT GTGATGGAAA CTTGACACTA TTTCTAACCA 1500 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 394 amino acids 
(3) TYPEs asaino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(Si) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met He Thr Met Leu Gin Asp Leu His Val Asn Lys He Ser Met Star 
1 5 10 15 

Arg Ser Lys Ser Glu Thr Ser Leu Pro Ser Ser Arg Ser Gly Ser Gin 

20 25 30 

Glu Lys He Met Asn Val Lys Gly Lys Val He Leu Leu Met Leu XI© 

35 40 45 

Val Ser Thr Val Val Val Val Phe Trp Glu Tyr Val Asn Arg He Pro 
50 55 60 

Glu Val Gly Glu Asn Arg Trp Gin Lys Asp Trp Trp Phe Pro Ser Trp 

65 70 75 80 
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Phe Lys Ash Gly 



Arg Glu Lys Gly 

aoo 

■v 

Asp Trp Phe Asn 
115 

Trp Lys Ala Pro 
130 

Glu Lys Tyr Tyr 
145 

Ala Val Gly Lys 



Ala Asp Met Tyr 

180 

lie Asp Asp Thr 
195 

Ser Leu Gin Val 
210 

Ser Met Met Arg 

225 

Gin His Glu Val 



Gin Asp Asn Phe 

260 

Gin Ala Trp Trp 
275 

Arg Glu Leu Ser 
290 

Tyr His Ala Ala 
305 

Thr Arg Glu Cys 



Glu Ala Gin Trp 

340 



Thr His Ser Tyr 
85 

Arg Asn Gly Asp 



Pro Lys Asn Arg 

120 

lie Val Trp Glu 
135 

Ala Thr Gin Lys 
150 

Tyr lie Glu His 
165 

Phe Met Val Gly 



Ser Arg Met Pro 

200 

Phe Glu lie Arg 
215 

Met Lys Thr He 

230 

Asp Phe Leu Phe 
245 

Gly Val Glu Thr 



Tyr Lys Ala Ser 

280 

Ala Ala Tyr He 
295 

He Phe Gly Gly 
310 

Phe Lys Gly He 
325 

His Asp Glu Ser 



Gin Glu Asp Asn 
90 

Arg He Glu Glu 
105 

Pro Asp Val Leu 



Gly Thr Tyr Asp 

140 

Leu Thr Val Gly 
155 

Tyr Leu Glu Asp 
170 

His Arg Val He 
185 

Val Val His Leu 



Ser Glu Lys Arg 

220 

Gly Glu His He 
235 

Cys Met Asp Val 
250 

Leu Gly Gin Leu 
265 

Pro Glu Lys Phe 



Pro Phe Gly Glu 

300 

Thr Pro Thr His 

315 

Leu Gin Asp Lys 
330 

His Leu Asn Lys 
345 



Val Glu Gly Arg 
95 

Pro Gin Leu Trp 
110 

Thr Val Thr Pro 
125 

Thr Ala Leu Leu 



Leu Thr Val Phe 

160 

Phe Leu Glu Ser 
175 

Phe Tyr Val Met 
190 

Asn Pro Leu His 
205 

Trp Gin Asp lie 



Leu Ala His He 

240 

Asp Gin Val Phe 
255 

Val Ala Gin Leu 
270 

Thr Tyr Glu Arg 
285 

Gly Asp Phe Tyr 



He Leu Asn Leu 

320 

Lys His Asp He 
335 

Tyr Phe Leu Phe 
350 



WO 97/09421 





CT/US96/13RI6 



-274- 



Asn Lys Pro Thr Lys He Leu Ser Pro Glu Tyr cys Trp Asp Tyr Gin 

355 360 365 

He Gly Leu Pro Ser Asp He Lys Ser Val Lys Val Ala Trp Gin Thr 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GAATTCCATC GTGGCAAGGG CAGCCTGAAT GGATGATGTA ACCTGGGGTC CTTTCAATGG 
AGGGCCAGAC TCCTGGGTCT AGGGGATGAG GGAGGGGAGG ATCGGGTTAG CTGGGACCCA 
GGTGAAAGGG GCTGGGGGCC CACATTCCTG AGTCTCAGAG AGAAGGATCT GGGGTCTCAA 
GCACCTGAGT CGGAGGGAGG AGGGGTGCTG GGCTCCTGGA AAAACCACCT CTTGGACCAT 
CTATGCAGAT CACGCAGAAC AAGAGAAATT TCTGCGCCCC ATCTGAATTT CTAAGTTTGG 
GGGGAGGGCG TGATCTGACA CTGAGGTTCC TTGATCCTCA GCAAGGCGGC AATTGCTGTA 
TGAAAGAAGC GACCGCATCT GAGACACAAG TATCCTGCCT TGGAAGCCTC TCACCTGGCC 
GTGGGCCAAC CTCAACCTCA TCTGTCCCTG CTCAGATGCT CAGACCCTGG ACATCCCAGC 
CTCCTCCTCC CTGATGCAAT CCTGGTGTTT CTTTCACCAG AGAAGCCATC CCAGGCCCAG 
GCAGGTGCTC CTGAAATAAC CTGGGGGGAG GGGTGGCTGA AAGTCCCTGA CTGGAGTTGG 
CAGCCAAGCC AGGCCCTGGA GTGGGCACCC AGAGGGAAGA CAGGTTGGCT AATTTCCTGG 
AGCCCCTAAG GGTGCAAGGG TAGGCCTTCT GTGTCTGAGG GAGGAGGGCT GGGGCTCTGG 
ACTCCTGGGT CTGAGGGAGG AGGGGTGGGG GGCCTGGACT CCTGGGTCTG AGGGAGGAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



370 



375 



380 



Lys Glu Tyr Asn Leu Val Arg Asn Asn Val 



385 390 
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GTCTGGGCCT 


GTACTCCTGG 


ATCTGAGGGA 


GGAGGGGCTG 


GGGAACTTGG 


GCTCCTGGGT 


CTGAGGGAGG 


AGGGAGCTTT 


GGTCTGGACT 


CCTGGGTCTG 


AGGGAGTAGG 


GGCTAGGGAT 


CTGGACTCGT 


GGGTGTGAGG 


AAGGAGGGGC 


TGGGGTCCTG 


GACTCCTGGG 


TCTGAGGAAG 


GAGGGGCAGG 


GGGC7TGGAC 


TCCTGGGTCT 


GAGGAAGGAG 


GGGCCGGGAG 


CCTGGACTCC 


TAAGTCTGAG 


GGAGGAGGGT 


CTGGGGGCCT 


GGACTGCTGG 


GTGTGAGCAG 


AAGGGTCTGG 


GTGCTGGGAG 


TCCCGAGCCT 


GGGGAGATGA 


TGGTTAAACT 


TCTGGGAATC 


AAGTCAAACT 


CCTGAGTCTT 


TGACATTGAT 


GTATCTTGAA 


TGGGAGGGTC AGTCTGTGGG 


GAAGGATTAC 


CCAGGTGCCG 


AGGCAAGAGA 


CTGAAGGCAC 


AAACTGTTTC AGTATAATAA 


AGAAAATAGT 


TAGAATAAGA 


ATAGTTATCA 


TACAAATTAG 


ATATAGAGAT 


GATCATGGAC 


AGTATCAATC 



ATTAGTGTAA ACATTATTAA TCATTAGCTA TTACTTTTAT T CTTT GTTGT ATAACTAATA 



TAACCAGGAA ACAACCGGTG GGTATAGGGT CAGGTACTGA AGGGACATTG TGAGAAGTGA 
CCTAGAAGGC AAGAGGTGAG CCTTCTGTCA CACCGGCATA AGGGCCTCTT GAGGGCTCCT 
TGGTCAAGCG GGAACGCCAG TGTCTGGGAA GGCACCCGTT ACTCAGGAGA CCACGAAAGG 
GAATCTCCTT TTCTTGGAGG AGTGAGGGAA CACTCTGCTC CACCAGCTTC TTGTGGGAGG 
CTGGGTATTA TCTAGGCCTG CCCGCAGTCA TCCTGCTGTG CTGTGCTTCA ATGGTCACGC 
TCCTTGTCCT CTTGCATTTT CCTCCCGTAC TCCTGGTTCC TCTTTGAAGT TCGTAGTAGA 
TAGCGGTAGA AGAAATAGTG AAAGCCTTTT TTTTTTTTTT TTTGAGGCGG AGTCTCGCTC 
TGTCCCCCAG GCTGGAGTGC AGTGGCGTGA TCTCGGCTCA CTGCAATCTC CGCCTCCTGG 
GTTCACACCA TTCTCCTGCC TCACCCTCCC AAATAGCTAG GACTACAGGC GCCCTCCACC 
ACGCGCCCGG ATAATTTTTT GTATTTTTAG TAGAGACAGG GTTTCACCGT GTTAGCCAGG 
ATGGCCTCCA CCTCCTGACC TTGTGATCCG CCCGCCTCAG CCTCCCAAAG TGCTGGGATT 
ACAGGCGTGA GCCACCGCGC CCGCCCGAAA TAGTGAAAGT CTTAAAGTCT TTGATCTTTC 
TTATAAGTGC AGAGAAGAAA ACGCTGACAT ATGCTGCCTT CTCTTTCTGC TTCGGCTGCC 
TAAAAGGGAA GGGCCCCCTG TCCCATGATC ACGTGACTTG CTTGACCTTA TCAGTCATTT 
GGACGACTCA CCCTCCTTAT CCTGCCCCCC CTTGTCTTGT ATACAATAAA TATCAGCGCG 
CCCAGCCATT CGGGGCCACT ACCGGTCTCT GCGTCTTGAT GGTAGTGGTC CCCCGGGCCC 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 

1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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AGCTGTTTTC TCTTTATCTC TTTGTCTTGT GTCTTTATTT CTTACAATCT CTCCTCTCCT 2400 

CACAGGGGAA GAACACCCAC CCGCAAAGCC CCGTAGGGCT GGACCCTACG TTAGCCTGCC 2460 # 

CTGCTCGGGG TTGGCGATGC TGGAGGTGGG CCTTGGACCA GAGAAAATGC TTTAATTAGG 2520 

TGACAAGCGG GCAGAGGCCT TTGTCTCTGG CGCCGGCAGC CACGGCCCCC GCTGACGGCG 2580 

TGGGAAACAG ACCCTGTTCC ACTCCGGTCT CCAGCCTTGG AATGGTTGCC TTCGTGCAGT 2640 

GCAGGTCTGG AAAGTAGCAG TTTGGCACGG GACCCTAGAA TTCCCCAAAA GGAGTGACTA 2700 

GGGGCTGGGA TTCTGGAATT TGAGTGTGGA CGGTGAGGCG GGGGGTGTGG GAGATCGGAG 2760 

ACCCTGGTGG GCGCGGGAGC ACCTGCAGGC TGGAGGCCCT CGCGCGCTCC GGCGGCAGCC 2820 

TGGGAAACAG GTTCTCCATC CCCCAGGAGG ACGCGGCAGA GGGCGGACGA TCGCTCCACT 2880 

CGCCGGGACC AGGTGCGGGG GCCCTGCCCA GCCGCTGGGG CGTGGCCAGG CTCGAAGCAC 2940 

CCAGGTGTCG GGGGCCGACT CTAAGCCCTG GCACCGGAAG AGAGAGGGCG GCGGATTGGA 3000 

CCTCCCGGCT CCAGCATTGC AACTGGGCGC TCCGTCTCCT GGTCCACGCA ATGATGCTGC 3060 

GGCTGCTCAG AAGCCAGGTA GCCTGCCCTG GGTGAAGCCT TCGCGCAGGT CAATGACGGG 3120 

GCGGAGGGGC AGGGCGCGGT CCCCTGCATC CCCGATCTGG GGAGCGGTGG GCCCAGGGGC 3180 

CATCGCCTTA GCCCCTGGCG CTGGGGCTCG GCGCCAAGTG ACGGGCGGGG CTCCACCTTC 3240 

CAGCCATCCG CCCGGCCCGG GAGGGCGGAC GCTGCGAGAC TCCCGGCCGC GCCCTCTCCT 3300 

TCCTCTCCTC CCCAAGCCCT CGCTGCCAGT CCGGACAGGC TGCGCGGAfiG GGAGGGCTGC 3360 

CGGGCCGGAT AGCCGGACGC CTGGCGTTCC AGGGGCGGCC GGATGTGGCC TGCCTTTGCG 3420 

GAGGGTGCGC TCCGGCCACG AAAAGCGGAC TGTGGATCTG CCACCTGCAA GCAGCTCGGG 3480 

TAAGTGGGGA CTGCCCCACT CAGTTGTTCC TGGGACCCAG GAACAACTCC TTCAGAACCA 3540 

GGAGGTGCAC CCCCAACCTC TTCTCCAGGT CTTCCTAAGG CCCTAGGAAT CTCCGCCACC 3600 

TCCCCAGCCA TTACTCCTCC AGGAACCAAG ATGCTCCTTC CGCTCCTGAC CCTCCAGCCT 3660 

CTCTTGTTTT ACTTGAACTA TCGTTTCCCA TCACCACCTC TGTGGTGGAT TTTGCGCCTC 3720 

ACAGACAGGT ACTCCTGAGA AACAGGCTGG TGGAAGAGTC CAGTATCAGC GGAACTTACA 3780 m 

GGAGGGGAGA CTCGAGATTC CTTCAGGAAA GGTGTAGGAA CCTGGACCAC TTTCTTTTTT 3840 

fppjuprpfpijrtpfjrtp XTTTTTTAAG ACAGGGTCCC TCTCTGTCGC GCAAGCTGGA GTGCAGTCAG 3900 « 
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C6GTGCTATC 


GCGGCTCATT 


GTGAGCTCCG GGGATCCTCC CGCCTTAGCA 


TCCGGTGTAG 


3960 


CTGAGACCAC 


AGACATGTGC 


CACCATGCCA AGCTAATTTT ATTTATTTTT 


TTTTGGAGAC 


4020 


GGAGTTTCAC 


TCTTGTTGCC 


CAGGCTGGAG TGTAATGGCA TGATCTCAGC 


TCACCGCAAC 


4080 


TCCCGCCCCC 


CGGGTTCAGG 


CGATTCTCCT GCCTCAGCCT CCCGAGTGGC 


TGGGATTACA 


4140 


GGCATGCGCC 


ACCATGCCCG 


GCTAATTTTG TATTTTAAGT AGAGACAGGG 


TTTCTCCACG 


4200 


TTGGTCAGGC 


TGGTCTCGAA 


CTCCCAACCT CAGGTGATCC ACCCACCTTG 


GCCTCCCAAA 


4260 


GTGCTGGGAT 


TACAGGTGTG 


AGCCACCGCG CCTGGCCCAT GCCAAGCTAA 


TTTTAAAATT 


4320 


TTTTTGTAAG 


AGTGCTCTGT 


TGCCCAGGCT GATCTTGAAC TCCTGGGCTC 


AAGGGATCCT 


4380 


CCCATCTCAG 


CCTCCCAATA 


TGCTGGGATT AGAGGTGTGA GCCACAGTGC 


CCAGCCAAAC 


4440 


CATGGCTATC 


TTGAAAACCA 


CTTGTCTTCC AGTCCCCATG CCCCGAAATT 


CCAAGGCTCT 


4500 


CATCCCTGAA 


ACCTAGGACT 


CAGGCTCTCC CTACCTCAGC CCCAGGAGTC 


TAAACCTTTA 


4560 


ACTTCCTCTT 


TCCCTGGGAC 


TAAGGAGTGC TGCACCCCAG GCGCCTCCCT 


TACCCCACAT 


4620 


CCCTCCTCAG 


CCTCCCCTCC 


TCAGCCTCAG TGCATTTGCT AATTCGCCTT 


TCCTCCCCTG 


4680 


CAGCCATGTG 


GCTCCGGAGC 


CATCGTCAGC TCTGCCTGGC CTTCCTGCTA 


GTCTGTGTCC 


4740 


TCTCTGTAAT 


CTTCTTCCTC 


CATATCCATC AAGACAGCTT TCCACATGGC 


CTAGGCCTGT 


4800 


CGATCCTGTG 


TCCAGACCGC 


CGCCTGGTGA CACCCCCAGT GGCCATCTTC 


TGCCTGCCGG 


4860 


GTACTGCGAT 


GGGCCCCAAC 


GCCTCCTCTT CCTGTCCCCA GCACCCTGCT 


TCCCTCTCCG 


4920 


GCACCTGGAC 


TGTCTACCCC 


AATGGCCGGT TTGGTAATCA GATGGGACAG 


TATGCCACGC 


4980 


TGCTGGCTCT 


GGCCCAGCTC 


AACGGCCGCC GGGCCTTTAT CCTGCCTGCC 


ATGCATGCCG 


5040 




GGTATTCCGC 


ATCACCCTGC CCGTGCTGGC CCCAGAAGTG 


GAGAGCCGCA 


5100 

W 


CGCCGTGGCG 


GGAGCTGCAG 


CTTCACGACT GGATGTCGGA GGAGTACGCG 


GACTTGAGAG 


5160 


ATCCTTTCCT 


GAAGCTCTCT 


GGCTTCCCCT GCTCTTGGAC TTTCTTCCAC 


CATCTCCGGG 


5220 


AACAGATCCG 


CAGAGAGTTC 


ACCCTGCACG ACCACCTTCG GGAAGAGGCG 


CAGAGTGTGC 


5280 


TGGGTCAGCT 


CCGCCTGGGC 


CGCACAGGGG ACCGCCCGCG CACCTTTGTC 


GGCGTCGACG 


5340 


TGCGCCGTGG 


GGACTATCTG 


CAGGTTATGC CTCAGCGCTG GAAGGGTGTG 


GTGGGCGACA 


5400 


GCGCCTACCT 


CCGGCAGGCC 


ATGGACTGGT TCCGGGCACG GCACGAAGCC 


CCCGTTTTCG 


5460 
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TGGTCACCAG CAACGGCATG GAGTGGTGTA AAGAAAACAT CGACACCTCC CAGGGCGATG 5520 

TGACGTTTGC TGGCGATGGA CAGGAGGCTA CACCGTGGAA AGACTTTGCC CTGCTCACAC 5580 t 

AGTGCAACCA CACCATTATG ACCATTGGCA CCTTCGGCTT CTGGGCTGCC TACCTGGCTG 5640 

GCGGAGACAC TGTCTACCTG GCCAACTTCA CCCTGCCAGA CTCTGAGTTC CTGAAGATCT 5700 

TTAAGCCGGA GGCGGCCTTC CTGCCCGAGT GGGTGGGCAT TAATGCAGAC TTGTCTCCAC 5760 

TCTGGACATT GGCTAAGCCT TGAGAGCCAG GGAGACTTTC TGAAGTAGCC TGATCTTTCT 5820 

AGAGCCAGCA GTACGTGGCT TCAGAGGCCT GGCATCTTCT GGAGAAGCTT GTGGTGTTCC 5880 

TGAAGCAAAT GGGTGCCCGT ATCCAGAGTG ATTCTAGTTG GGAGAGTTGG AGAGAAGGGG 5940 

GACGTTTCTG GAACTGTCTG AATATTCTAG AACTAGCAAA ACATCTTTTC CTGATGGCTG 6000 

GCACGCAGTT CTAGAAGCCA CAGTGCCCAC CTGCTCTTCC CAGCCCATAT CTACAGTACT 6060 

TCCAGATGGC TGCCCCCAGG AATGGGGAAC TCTCCCTCTG GTCTACTCTA GAAGAGGGGT 6120 

TACTTCTCCC CTGGGTCCTC CAAAGACTGA AGGAGCATAT GATTGCTCCA GAGCAAGCAT 6180 

TCACCAAGTC CCCTTCTGTG TTTCTGGAGT GATTCTAGAG GGAGACTTGT TCTAGAGAGG 6240 

ACCAGGTTTG ATGCCTGTGA AGAACCCTGC AGGGCCCTTA TGGACAGGAT GGGGTTCTGG 6300 

AAATCCAGAT AACTAAGGTG AAGAATCTTT TTAGTTTTTT TTTTTTTTTT TTGGAGACAG 6360 

GGTCTCGCTC TGTTGCCCAG GCTGGAGTGC AGTGGCGTGA TCTTGGCTCA CTGCAACTTC 6420 

CGCCTCCTGT GTTCAAGCGA TTCTCCTGTC TCAGCCTCCT GAGTAGATGG GACTACAGGC 6480 

ACAGGCCATT ATGCCTGGCT AATTTTTGTA TTTTTAGTAG AGACAGGGTT TCACCATGTT 6540 

GGCCGGGATG GTCTCGATCT CCTGACCTTG TCATCCACCT GTCTTGGCCT CCCAAAGTGC 6600 

TGGGATTACT GGCATGAGCC ACTGTGCCCA GCCCGGATAT TTTTTTTTAA TTATTTATTT 6660 

ATTTATTTAT TTATTGAGAC GGAGTCTTGC TCTGTAGCCC AGGCCAGAGT GCAGTGGCGC 6720 

GATCTCAGCT CACTGCAAGC TCTGCCTCCC GGGTTCATGC CATTCTGCCT CAGCCTCCTG 6780 

AGTAGCTGGG ACTACAGGCG CCCGCCACCA CGCCCGGCTA ATTTTTTTTG TATTTTTAGT 6840 

AGAGACGGGG TTTCATCGTG TTAACCAGGA TGGTCTCGAT CTCCTGACCT CGTGATCTGC 6900 . 

CCACCTCGGC CTCCCACAGT GCTGGGATTA CCGGCGTGAG CCACCATGCC TGGCCCGGAT 6960 

AATTTTTTTT AATTTTTGTA GAGACGAGGT CTTGTGATAT TGCCCAGGCT GTTCTTCAAC 7020 
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TCCTGGGCTC AAGCAGTCCT 


CCCACCTTGG CCTCCCAGAA TGCTGGGTTT ATAGATGTGA 


7080 


GCCAGCACAC 


CGGGCCAAGT 


GAAGAATCTA ATGAATGTGC AACCTAATTG TAGCATCTAA 


7140 


TGAATGTTCC ACCATTGCTG 


GAAAAATTGA GATGGAAAAC AAACCATCTC TAGTTGGCCA 


7200 


GCGTCTTGCT 


CTGTTCACAG 


TCTCTGGAAA AGCTGGGGTA GTTGGTGAGC AGAGCGGGAC 


7260 


TCTGTCCAAC AAGCCCCACA 


GCCCCTCAAA GACTTTTTTT TGTTTGTTTT GAGCAGACAG 


7320 


GCTAAAATGT 


GAACGTGGGG 


TGAGGGATCA CTGCCAAAAT GGTACAGCTT CTGGAGCAGA 


7380 


ACTTTCCAGG 


GATCCAGGGA 


GACTTTTTTT TAAAGCTCAT AAACTGCCAA GAGCTCCATA 


7440 


TATTGGGTGT 


GAGTTCAGGT 


TGCCTCTCAC AATGAAGGAA GTTGGTCTTT GTCTGCAGGT 


7500 


GGGCTGCTGA 


GGGTCTGGGA 


TCTGTTTTCT GGAAGTGTGC AGGTATAAAC ACACCCTCTG 


7560 


TGCTTGTGAC 


AAACTGGCAG 


GTACCGTGCT CATTGCTAAC CACTGTCTGT CCCTGAACTC 


7620 


CCAGAACCAC 


TACATCTGGC 


TTTGGGCAGG TCTGAGATAA AACGATCTAA AGGTAGGCAG 


7680 


ACCCTGGACC 


CAGCCTCAGA 


TCCAGGCAGG AGCACGAGGT CTGGCCAAGG TGGACGGGGT 


7740 


TGTCGAGATC 


TCAGGAGCCC 


CTTGCTGTTT TTTGGAGGGT GAAAGAAGAA ACCTTAAACA 


7800 


TAGTCAGCTC 


TGATCACATC 


r*r*r+n*r*rvr % rT*'& r* Tf&T^flifilkf rrriTftrrTfl TlflflfPTATP 


7860 


AGGGAGTTAC 


AGTTACAATT 


GTTACAGTAC TGTTCCCAAC TCAGCTGCCA CGGGTGAGAG 


7920 


AGCAGGAGGT 


ATGAATTAAA 


AGTCTACAGC ACTAACCCGT GTCTCTGTAG CTTTTTTGGA 


7980 


GCCAGAGCCA 


CTGTGTATGT 


GTGTGTGGGT TTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 


8040 


AAGAGAGTGG AGGAAAAGGT 


GGGGTACTTC TGAAGACTTT TATTTTTTTT TAATTAATTT 


8100 


ATTTTTTTTC AGAGATCGAG 


TCTTGCTCTG TGGCCCAGGC TGGAGTGCAG TAGTGTGATC 


8160 


TCGGCCCACT 


GCAA 




8174 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



T 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

- , Q __ His Ara Gin Leu Cys Leu Ala Phe Leu Leu Val 

net Trp Leu Arg Ser mis Arg »»■»•«• w " * 15 



1 5 



Cys val Leu Ser Val lie Phe Phe Leu His He His Gin Asp Ser Phe 



20 25 



Pro His Gly Leu Gly Leu Ser lie Leu Cys Pro Asp Arg Arg Leu Val 

35 40 
Thr Pro Pro Val Ala lie Phe Cys Leu Pro Gly Thr Ala Met Gly Pro 
50 55 

^ _ _„ ti -; c- pro Ala Ser Leu Ser Gly Thr 
Asn Ala Ser Ser Ser Cys Pro Gin His Pro Aia ser gQ 

65 70 

Trp Thr V.1 Tyr Fro Asn Gly Ar, Phe Gly Asn Gin Met Gly Gin Tyr 

85 

Ala Thr Leu Leu Ala Leu Ala Gin Leu Asn Gly Arg Arg Ala Phe lie 

100 10S> 

Leu Pro Ala Met His Ala Ala Leu Ala Pro Val Phe Arg He Thr Leu 



115 



120 



Pro Val Leu Ala Pro Glu Val Asp Ser Arg Thr Pro Trp Arg Glu Leu 



130 



135 



Gin Leu His Asp Trp Met Ser Glu Glu Tyr Ala Asp Leu Arg Asp Pro 



145 



150 



Phe Leu Lys Leu Ser Gly Phe Pro Cys Ser Trp Thr Phe Phe His His 



165 



Leu Arg Glu Gin lie Arg Arg Glu Phe Thr Leu His Asp His Leu Arg 



180 185 



Glu Glu Ala Gin ser Val Leu Gly Gin Leu Arg Leu Gly Arg Thr Gly 

195 200 

Asp Arg Pro Ar, Thr P»e val Gly Val Hi. Val Ar, Arg Gly Asp Tyr 



210 215 



Leu Gin Val Met Pro Gin Arg Trp Lys Gly Val Val Gly Asp Ser Ala 



225 



230 



Tyr Lau Arg Gin Ala Met Asp Trp Phe Arg Ala Arg His Glu Ala Pro 

245 



Val Phe Val Val Thr Ser Asn Gly Met Glu Trp Cys Lys Glu Asn lie 
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260 265 270 

Asp Thr Ser Gin Gly Asp Val Thr Phe Ala Gly Asp Gly Gin Glu Ala 
275 280 285 

Thr Pro Trp Lys Asp Phe Ala Leu Leu Thr Gin Cys Asn His Thr lie 

290 295 300 

Met Thr lie Gly Thr Phe Gly Phe Trp Ala Ala Tyr Leu Ala Gly Gly 
305 310 315 320 

Asp Thr Val Tyr Leu Ala Asn Phe Thr Leu Pro Asp Ser Glu Phe Leu 

325 330 335 

Lys lie Phe Lys Pro Glu Ala Ala Phe Leu Pro Glu Trp Val Gly lie 

340 345 350 

Asn Ala Asp Leu Ser Pro Leu Trp Thr Leu Ala Lys Pro 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 7: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANT I -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTGCAGAGAG CGCCACCCGG AAGCCACTTT TATAGAAGCT TTTACACACA ATGCTTGATT 60 

TTTTTTTTTT TTTTCCGAGA CGGAGTCTCG CTTTGTCGCC CAGGCTGGAG TGCAGTGGCG 120 

CGATCTGGGC TCACTGCAAG CTCCGCCTCC TGGGTTGACG CCATTCTCCT GCCTCAGCTT 180 

CCCGAGTAGC TGGGACTACA GGCGCCCGCC ACCAAGCCTG GCTAATTTTT TTTTATTTTT 240 

AGTGGAGACA GAGTTTCACC GTGTTAGCCA GGATGGTCTC GATCTCCTGA CCTCGGGATC 300 

CGCCCGCCTC GGCCTCCCAA AGTGCTGGGA GTATAGGCGT GAGCCACCGC GCCTGGCCTA 360 

TACTTGATTT TTAATGAAAA CATTCTTAAA TTCATATGGC TAACGCAAAT TTATTTTCTG 420 
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TAGGCATAAC ATCAAAAACA CCTGGCAGGA CTGCCCCATT CCCAGCACTG TCTAGTTCTC 
CCCTAGTATC AGTGGGACTC CACTGATGCA CAGCTGTGAT CTACTAAAAC TTCTCTCAAA 
ACTTTCTCCT CTCCTTAGGT CAGCAGCCCC GCCCCTGATC TATTTGGAAA TCCCCTGAAT 
AAAAGTTGAA TATCATAAAC CAAAGCGAAC ACCCAGAAAT TCAAATTCAA CCCGTAGGTA 
AAAAATTTCT CAAGTGACTG TAGACGTAGA TGTCTCCAGT GTCGCCTAAT AAGGTAGAAG 
AGGCCAGTGC GATACTGTCT TTACACCCTT AACTTGGGTG CTAGAATATT TATCTTCGTC 
ATCATTTTAT CATCCAAACT ATTTTGCATA ACTTTCATGG GTGCAGAAAA TGTTTTTTAA 
GTGCTTGGTA AAATTAATAG TGATATTCAT TCATTCATCT CACTGAACAG GCAATAAATT 
CCTTGACGAC AAGGGCCTTG GGGGGGGCCA CATCTTCATC TTTGGTTTAT GAGTCCTGTG 
CGTCTTGGTA CAAGCAATAC TACTATGAGC CGGCAAGTCA GACTTATTTG GTAGGGGACC 
AAAGGAAAGA ACATGTTTTG ATTGCTAAGA AAACATTTTG TTCTCTATCC TTTACTGGGC 
TGGCAGQCAA AGGAAATGTT CTTATGAGCA CTCACATTGA AAACTTAAGT TCTTCACCAA 
ATGCAGAGAC TCTGAAGGCC ACGCCGCTGC GGGCTGCCTC CACAATTCGA CCGTCTCGGC 
GGGCCACGAG ATCCTGGCCA CGGATGCGGT GGCCGCGCCT CTGCTCGCAC GTTCCCCCGG 
CCTCTGGACT CCCTCCCTCC CTCAATCCCT CCCTCCGGCG GGCGTCGCTG GCGGGTGGCT 
AGGCCCAACG GCAGGAAGCC GACGCTATCC TCCGTTCCGC GGCGCCGGGT CCGCCTTCCG 
TCTGTTCTAG GGCCTGCTCC TGCGCGGCAG CTGCTTTAGA AGGTCTCGAG CCTCCTGTAC 
CTTCCCAGGG ATGAACCGGG CCTTCCCTCT GGAAGGCGAG GGTTCGGGCC ACAGTGAGCG 
AGGGCCAGGG CGGTGGGCGC GCGCAGAGGG AAACCGGATC AGTTGAGAGA GAATCAAGAG 
TAGCGGATGA GGCGCTTGTG GGGCGCGGCC CGGAAGCCCT CGGGCGCGGG CTGGGAGAAG 
GAGTGGGCGG AGGCGCCGCA GGAGGCTCCC GGGGCCTGGT CGGGCCGGCT GGGCCCCGGG 
CGCAGTGGAA GAAAGGGACG GGCGGTGCCC GGTTGGGCGT CCTGGCCAGC TCACCTTGCC 
CTGGCGGCTC GCCCCGCCCG GCACTTGGGA GGAGCAGGGC AGGGCCCGCG GCCTTTGCAT 
TCTGGGACCG CCCCCTTCCA TTCCCGGGCC AGCGGCGAGC GGCAGCGACG GCTGGAGCCG 
CAGCTACAGC ATGAGAGCCG GTGCCGCTCC TCCACGCCTG CGGACGCGTG GCGAGCGGAG 
GCAGCGCTGC CTGTTCGCGC CATGGGGGCA CCGTGGGGCT CGCCGACGGC GGCGGCGGGC 



TCTAGTTCTC 


480 


TTCTCTCAAA 


540 c 


TCCCCTGAAT 


600 


CCCGTAGGTA 


660 


AAGGTAGAAG 


720 


TATCTTCGTC 


780 


TGTTTTTTAA 


840 


GCAATAAATT 


900 


GAGTCCTGTG 


960 


GTAGGGGACC 


1020 


TTTACTGGGC 


1080 


TCTTCACCAA 


1140 


CCGTCTCGGC 


1200 


GTTCCCCCGG 


1260 


GCGGGTGGCT 


1320 


CCGCCTTCCG 


1380 


CCTCCTGTAC 


1440 


ACAGTGAGCG 


1500 


GAATCAAGAG 


1560 


CTGGGAGAAG 


1620 


GGGCCCCGGG 


1680 


TCACCTTGCC 


1740 


GCCTTTGCAT 


1800 


GCTGGAGCCG 


1860 . 


GCGAGCGGAG 


1920 


GGCGGCGGGC 


1980 t 
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t 



GGGCGGCGCG 


GGTGGCGCCG 


AGGCCGGGGG 


CTGCCATGGA CCGTCTGTGT 


GCTGGCGGCC 


2040 


GCCGGCTTGA 


CGTGTACGGC 


GCTGATCACC 


TACGCTTGCT GGGGGCAGCT 


GCCGCCGCTG 


2100 


CCCTGGGCGT 


CGCCAACCCC 


GTCGCGACCG 


GTGGGCGTGC TGCTGTGGTG 


GGAGCCCTTC 


2160 


GGGGGGCGCG ATAGCGCCCC 


GAGGCCGCCC 


CCTGACTGCC CGCTGCGCTT 


CAACATCAGC 


2220 


GGCTGCCGCC TGCTCACCGA 


CCGCGCGTCC 


TACGGAGAGG CTCAGGCCGT 


GCTTTTCCAC 


2280 


CACCGCGACC TCGTGAAGGG 


GCCCCCCGAC 


TGGCCCCCGC CCTGGGGCAT 


CCAGGCGCAC 


2340 


ACTGCCGAGG AGGTGGATCT GCGCGTGTTG 


GACTACGAGG AGGCAGCGGC 


GGCGGGAGAA 


2400 


GCCCTGGCGA 


CCTCCAGCCC 


CAGGCCCCCG 


GGCCAGCGCT GGGTTTGGAT 


GAACTTCGAG 


2460 


TCGCCCTCGC ACTCCCCGGG 


GCTGCGAAGC 


CTGGCAAGTA ACCTCTTCAA 


CTGGACGCTC 


2520 


TCCTACCGGG 


CGGACTCGGA 


CGTCTTTGTG 


CCTTATGGCT ACCTCTACCC 


CAGAAGCCAC 


2580 


CCCGGCGACC 


CGCCCTCAGG 


CCTGGCCCCG 


CCACTGTCCA GGAAACAGGG 


GCTGGTGGCA 


2640 


TGGGTGGTGA GCCACTGGGA CGACCGCGAG 


GCCCGGGTCC GCTACTACCA 


CCAACTGAGC 


2700 


CAACATGTGA CCGTGGACGT 


GTTCGGCCGG 


GGCGGGCCGG GGCAGCCGGT 


GCCCGAAATT 


2760 


GGGCTCCTGC 


ACACAGTGGC 


CCGCTACAAG 


TTCTACCTGG CTTTCGAGAA 


CTCGCAGCAC 


2820 


CTGGATTATA 


TCACCGAGAA 


GCTCTGGCGC 


AACGCGTTGC TCGCTGGGGC 


GGTGCCGGTG 


2880 


6TGCTGGGCC 


CAGACCGTGC 


CAACTACGAG 


GCGTTTGTGC CCCGCGGCGC 


CTTCATCCAC 


2940 


GTGGACGACT 


TCCCAAGTGC 


CTCCTCCCTG 


GCCTCGTACC TGCTTTTCCT 


CGACCGCAAC 


3000 


CCCGCGGTCT 


ATCGCCGCTA 


CTTCCACTGG 


CGCCGGAGCT ACGCTGTCCA 


CATCACCTCC 


3060 


TTCTGGGACG 


AGCCTTGGTG 


CCGGGTGTGC 


CAGGCTGTAC AGAGGGCTGG 


GGACCGGCCC 


3120 


AAGAGCATAC 


GGAACTTGGC 


CAGCTGGTTC 


GAGCGGTGAA GCCGCGCTCC 


CCTGGAAGCG 


3180 


ACCGAGGGGA 


GCCCAAGTTG TCAGCTTTTT 


GATCCTCTAC TGTGCATCTC 


CTTGACTGCC 


3240 


GGATCATGGG 


AGTAAGTTCT 


TCAAACACCC 


ATTTTTGCTC TATGGGAAAA 


AAACGATTTA 


3300 


CCAATTAATA TTACTCAGCA CAGAGATGGG 


GGCCCGGTTT CCATATTTTT 


TGCACAGCTA 


3360 


GCAATTGGGC 


TCCCTTTGCT 


GCTGATGGGC 


ATCATTGTTT AGGGGTGAAG 


GAGGGGGTTC 


3420 


TTCCTCACCT 


TGTAACCAGT 


GCAGAAATGA 


AATAGCTTAG CGGCAAGAAG 


CCGTTGAGGC 


3480 


GGTTTCCTGA ATTTCCCCAT CTGCCACAGG 


CCATATTTGT GGCCCGTGCA 


GCTTCCAAAT 


3540 
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CTCATACACA ACTGTTCCCG ATTCACGTTT TTCTGGACCA AGGTGAAGCA AATTTGTGGT 3600 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 asino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Ala Pro Trp Glv Ser Pro Thr Ala Ala Ala Gly Gly Arg Arg 
1 5 10 15 

Gly Trp Arg Arg Gly Arg Gly Leu Pro Trp Thr Val Cys Val Leu Ala 

20 25 30 

Ala Ala Gly Leu Thr Cys Thr Ala Lou lie Thr Tyr Ala cys Trp Gly 

35 40 45 

Gin Leu Pro Pro Leu Pro Trp Ala Ser Pro Thr Pro Ser Arg Pro Val 

50 55 60 

Gly Val Leu Leu Trp Trp Glu Pro Phe Gly Gly Arg Asp Ser Ala Pro 
65 70* 75 80 

Arg Pro Pro Pro Asp Cvs Pro Leu Arg Phe Asn He Ser Gly Cys Arg 

85 " 90 95 

Leu Leu Thr Asp Arg Ala Ser Tyr Gly Glu Ala Gin Ala Val Leu Phe 

100 105 110 

His His Arg Asp Leu Val Lys Gly Pro Pro Asp Trp Pro Pro Pro Trp 
115 120 125 

Gly He Gin Ala His Thr Ala Glu Glu Val Asp Leu Arg Val Leu Asp 
130 135 140 

Tyr Glu Glu Ala Ala Ala Ala Ala Glu Ala Leu Ala Thr Sar ser Pro 

145 150 155 160 

Arg Pro Pro Gly Gin Arg Trp Val Trp Met Asn Phe Glu Ser Pro Ser 



TGTAGAAGGA GCCTTGTTGG TGGAGAGTGG AAGGACTGTG GCTGCAG 



3647 



165 



170 



175 
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His Ser Pro Gly Leu Arg Ser Leu Ala Ser Asn Leu Phe Asn Trp Thr 

180 185 190 

Leu Ser Tyr Arg Ala Asp Ser Asp Val Phe Val Pro Tyr Gly Tyr Leu 
195 200 205 

Tyr Pro Arg Ser His Pro Gly Asp Pro Pro Ser Gly Leu Ala Pro Pro 
210 215 220 

Leu Ser Arg Lys Gin Gly Leu Val Ala Trp Val Val Ser His Trp Asp 

230 235 240 



Glu Arg Gin Ala Arg Val Arg Tyr Tyr His Gin Leu Ser Gin His Val 

245 250 



Thr Val Asp Val Phe Gly Arg Gly Gly Pro Gly Gin Pro Val Pro Glu 

260 265 270 

lie Gly Leu Leu His Thr Val Ala Arg Tyr Lys Phe Tyr Leu Ala Phe 

275 280 285 

Glu Asn Ser Gin His Leu Asp Tyr lie Thr Glu Lys Leu Trp Arg Asn 

290 295 300 

Ala Leu Leu Ala Gly Ala Val Pro Val Val Leu Gly Pro Asp Arg Ala 

305 310 315 320 

Asn Tyr Glu Arg Phe Val Pro Arg Gly Ala Phe lie His Val Asp Asp 

325 330 



Phe Pro Ser Ala Ser Ser Leu Ala Ser Tyr Leu Leu Phe Leu Asp Arg 

340 345 350 

Asn Pro Ala Val Tyr Arg Arg Tyr Phe His Trp Arg Arg Ser Tyr Ala 

355 360 365 

Val His lie Thr Ser Phe Trp Asp Glu Pro Trp Cys Arg Val Cys Gin 
370 375 380 

Ala Val Gin Arg Ala Gly Asp Arg Pro Lys Ser lie Arg Asn Leu Ala 
385 390 395 400 

Ser Trp Phe Glu Arg 

405 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
! ATGGGGGCAC CGTGGGGCTC GCCGACGGCG GCGGCGGGCG GGCGGCGCGG GTGGCGCCGA 
GGCCCGGGGC TGCCATGGAC CGTCTGTGTG CTGGCGGCCG CCGGCTTGAC GTGTACGGCG 
CTGATCACCT ACGCTTGCTG GGGGCAGCTG CCGCCGCTGC CCTGGGCGTC GCCAACCCCG 
TCGCGACCGG TGGGCGTGCT GCTGTGGTGG GAGCCCTTCG GGGGGCGCGA TAGCGCCCCG 
AGGCCGCCCC CTGACTGCTG CTGGGGGCAG CTGCCGCCGC TGCCCTGGGC GTCGCCAACC 
CCGTCGCGAC CGGTGGGCGT GCTGCTGTGG TGGGAGCCCT TCGGGGGGCG CGATAGCGCC 
CCGAGGCCGC CCCCTGACTG CCCGCTGCGC TTCAACATCA GCGGCTGCCG CCTGCTCACC 
GACCGCGCGT CCTACGGAGA GGCTCAGGCC GTGCTTTTCC ACCACCGCGA CCTCGTGAAG 
GGGCCCCCCG ACTGGCCCCC GCCCTGGGGC ATCCAGGCGC ACACTGCCGA GCCGCTGCGC 
TTCAACATCA GCGGCTGCCG CCTGCTCACC GACCGCGCGT CCTACGGAGA GGCTCAGGCC 
GTGCTTTTCC ACCACCGCGA CCTCGTGAAG GGGCCCCCCG ACTGGCCCCC GCCCTGGGGC 
ATCCAGGCGC ACACTGCCGA GGAGGTGGAT CTGCGCGTGT TGGACTACGA GGAGGCAGCG 
GCGGCGGCAG AAGCCCTGGC GACCTCCAGC CCCAGGCCCC CGGGCCAGCG CTGGGTTTGG 
ATGAACTTCG AGTCGCCCTC GCACTCCCCG GGGCTGCGAA GCCTGGCAAG TAACCTCTTC 
AACTGGACGC TCTCCTACCG GGCGGACTCG GACGTCTTTG TGCCTTATGG CTACCTCTAC 
CCCAGAAGCC ACCCCGGCGA CCCGCCCTCA GGCCTGGCCC CGCCACTGTC CAGGAAACAG 
GGGCTGGTGG CATGGGTGGT GAGCCACTGG GACGAGCGCC AGGCCCGGGT CCGCTACTAC 
CACCAACTGA GCCAACATGT GACCGTGGAC GTGTTCGGCC GGGGCGGGCC GGGGCAGCCG 
GTGCCCGAAA TTGGGCTCCT GCACACAGTG GCCCGCTACA AGTTCTACCT GGCTTTCGAG 
AACTCGCAGC ACCTGGATTA TATCACCGAG AAGCTCTGGC GCAACGCGTT GCTCGCTGGG 
GCGGTGCCGG TGGTGCTGGG CCCAGACCGT GCCAACTACG AGCGCTTTGT GCCCCGCGGC 
GCCTTCATCC ACGTGGACGA CTTCCCAAGT GCCTCCTCCC TGGCCTCGTA CCTGCTTTTC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 « 
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CTCGACCGCA ACCCCGCGGT CTATCGCCGC TACTTCCACT GGCGCCGGAG CTACGCTGTC 1380 
CACATCACCT CCTTCTGGGA CGAGCCTTGG TGCCGGGTGT GCCAGGCTGT ACAGAGGGCT 1440 
GGGGACCGGC CCAAGAGCAT ACGGAACTTG GCCAGCTGGT TCGAGCGG 1488 



(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TTTATGACAA GCTGTGTCAT AAATTATAAC AGCTTCTCTC AGGACACTGT GGCCAGGAAG 60 

TGGGTGATCT TCCTTAATGA CCCTCACTCC TCTCTCCTCT CTTCCCAGCT ACTCTGACCC 120 

ATGGATCCCC TGGGCCCAGC CAAGCCACAG TGGCTGTGGC GCCGCTGTCT GGCCGGGCTG 180 

CTGTTTCAGC TGCTGGTGGC TGTGTGTTTC TTCTCCTACC TGCGTGTGTC CCGAGACGAT 240 

GCCACTGGAT CCCCTAGGCC AGGGCTTATG GCAGTGGAAC CTGTCACCGG GGCTCCCAAT 300 

GGGTCCCGCT GCCAGGACAG CATGGCGACC CCTGCCCACC CCACCCTACT GATCCTGCTG 360 

TGGACGTGGC CTTTTAACAC ACCCGTGGCT CTGCCCCGCT GCTCAGAGAT GGTGCCCGGC 420 

GCGGCCGACT GCAACATCAC TGCCGACTCC AGTGTGTACC CACAGGCAGA CGCGGTCATC 480 

GTGCACCACT GGGATATCAT GTACAACCCC AGTGCCAACC TCCCGCCCCC CACCAGGCCG 540 

CAGGGGCAGC GCTGGATCTG GTTCAGCATG GAGTCCCCCA GCAACTGCCG GCACCTGGAA 600 

GCCCTGGACG GATACTTCAA TCTCACCATG TCCTACCGCA GCGACTCCGA CATCTTCACG 660 

CCCTACGGCT GGCTGGAGCC GTGGTCCGGC CAGCCTGCCC ACCCACCGCT CAACCTCTCG 720 

GCCAAGACCG AGCTGGTGGC CTGGGCGGTG TCCAACTGGA AGCCGGACTC GGCCAGGGTG 780 

CGCTACTACC AGAGCCTGCA GGCTCATCTC AAGGTGGACG TGTACGGACG CTCCCACAAG 840 

CCCCTGCCCA AGGGGACCAT GATGGAGACG CTGTCCCGGT ACAAGTTCTA TCTGGCCTTC 900 
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GAGAACTCCT TGCACCCCGA CTACATCACC GAGAAGCTGT GGAGGAACGC CCTGGAGGCC 960 

TGGGCCGTGC CCGTGGTGCT GGGCCCCAGC AGAAGCAACT ACGAGAGGTT CCTGCCGCCC 1020 

GACGCCTTCA TCCACGTGGA TGACTTCCAG AGCCCCAAGG ACCTGGCCCG GTACCTGCAG 1080 

GAGCTGGACA AGGACCACGC CCGCTACCTG AGCTACTTTC GCTGGCGGGA GACGCTGCGG 1140 

CCTCGCTCCT TCAGCTGGGC ACTGGCTTTC TGCAAGGCCT GCTGGAAGCT GCAGCAGGAA 1200 

TCCAGGTACC AGACGGTGCG CAGCATAGCG GCTTGGTTCA CCTGAGAGGC CGGCATGGGG 1260 

CCTGGGCTGC CAGGGACCTC ACTTTCCCAG GGCCTCACCT ACCTAGGGTC TCTAGA 1316 
(2) INFORMATION FOR SEQ ID NOsll: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 374 aaino acids 
(3) TYPE: aaino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(2£i) SEQUENCE DESCRIPTIONS SEQ ID NO: 11: 

Met Asp Pro Leu Gly Pro Ala Lys Pro Gin Trp Leu Trp Arg Arg Cys 
15 10 15 

Leu Ala Gly Leu Leu Phe Gin Leu Leu Val Ala Val Cys Phe Ph© S©r 

20 25 30 

Tyr Leu Arg Val Ser Arg Asp Asp Ala Thr Gly Ser Pro Arg Pro Gly 
35 40 45 

Leu Met Ala Val Glu Pro Val Thr Gly Ala Pro Asn Gly Ser Arg cys 
50 55 60 

Gin Asp Ser Met Ala Thr Pro Ala His Pro Thr Leu Leu He Leu Leu 
65 70 75 80 

Trp Thr Trp Pro Phe Asn Thr Pro Val Ala Leu Pro Arg Cys Ser Glu 

85 90 95 

Met Val Pro Gly Ala Ala Asp Cys Asn He Thr Ala Asp Ser Ser Val 

100 105 HO 

Tyr Pro Gin Ala Asp Ala Val He Val His His Trp Asp He Met Tyr 
115 120 125 



WO 97/D942! 




• 



CT/US9fi/fi381!6 



-289 



Asn Pro 
130 



Ala Asn Leu Pro Pro Pro Thr Arg Pro Gin Gly Gin Arg 

135 140 



Trp lie Trp Phe Ser Met Glu Ser Pro Ser 
145 150 



Asn Cys Arg His Leu Glu 
155 160 



Ala Leu Asp Gly Tyr Phe Asn Leu Thr Met 

165 170 



Tyr Arg Ser Asp Ser 

175 



Asp lie Phe Thr Pro Tyr Gly Trp Leu Glu Pro Trp 

180 185 



Gly Gin Pro 
190 



Ala His Pro Pro Leu Asn Leu Ser Ala Lys Thr Glu Leu Val Ala Trp 

195 200 205 

Ala Val Ser Asn Trp Lys Pro Asp Ser Ala Arg Val Arg Tyr Tyr Gin 
210 215 220 



Ser Leu Gin Ala His Leu Lys Val Asp Val Tyr Gly Arg 
225 230 235 



His Lys 
240 



Pro Leu Pro Lys Gly Thr Met Met Glu Thr Leu Ser Arg Tyr Lys Phe 

245 250 



Tyr Leu Ala Phe Glu Asn 

260 



Leu His Pro 
265 



Tyr lie Thr Glu Lys 
270 



Leu Trp Arg Asn Ala Leu Glu Ala Trp Ala Val 

275 230 



Val Val Leu Gly 
285 



Pro Ser Arg 
290 



Asn Tyr Glu 

295 



Phe Leu Pro Pro 

300 



Asp Ala Phe lie 



His Val Asp Asp Phe Gin 
305 310 



Pro Lys Asp Leu Ala Arg Tyr Leu Gin 

315 320 



Glu Leu Asp Lys Asp His Ala Arg Tyr Leu Ser Tyr Phe Arg Trp 

325 330 335 



Glu Thr Leu Arg Pro Arg 

340 



Phe Ser Trp Ala Leu Ala Phe Cys Lys 
345 350 



Ala Cys Trp Lys Leu Gin Gin Glu 
355 360 



Arg Tyr Gin Thr Val Arg Ser 

365 



lie Ala Ala Trp Phe Thr 
370 



INFORMATION FOR SEQ ID NO; 12: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown • 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATGGATCCCC TGGGTGCAGC CAAGCCACAA TGGCCATGGC GCCGCTGTCT 


GGCCGCACTG 


60 


CTATTTCAGC TGCTGGTGGC TGTGTGTTTC TTCTCCTACC TGCGTGTGTC 


CCGAGACGAT 


120 


GCCACTGGAT CCCCTAGGGC TCCCAGTGGG TCCTCCCGAC AGGACACCAC 


TCCCACCCGC 


180 


CCCACCCTCC TGATCCTGCT ATGGACATGG CCTTTCCACA TCCCTGTGGC 


TCTGTCCCGC 


240 


TGTTCAGAGA TGGTGCCCGG CACAGCCGAC TGCCACATCA CTGCCGACCG 


CAAGGTGTAC 


300 


CCACAGGCAG ACACGGTCAT CGTGCACCAC TGGGATATCA TGTCCAACCC 

^ 


TAAGTCACGC 


360 


CTCCCACCTT CCCCGAGGCC GCAGGGGCAG CGCTGGATCT GGTTCAACTT 


GGAGCCACCC 


420 


CCTAACTGCC AGCACCTGGA AGCCCTGGAC AGATACTTCA ATCTCACCAT 


GTCCTACCGC 


480 


AGCGACTCCG ACATCTTCAC GCCCTACGGC TGGCTGGAGC CGTGGTCCGG 


CCAGCCTGCC 


540 


CACCCACCGC TCAACCTCTC GGCCAAGACC GAGCTGGTGG CCTGGGCGGT 


GTCCAACTGG 


600 


AAGCCGGACT CAGCCAGGGT GCGCTACTAC CAGAGCCTGC AGGCTCATCT 


CAAGGTGGAC 


660 


GTGTACGGAC GCTCCCACAA GCCCCTGCCC AAGGGGACCA TGATGGAGAC 


GCTGTCCCGG 


720 


TACAAGTTCT ACCTGGCCTT CGAGAACTCC TTGCACCCCG ACTACATCAC 


CGAGAAGCTG 


780 


TGGAGGAACG CCCTGGAGGC CTGGGCCGTG CCCGTGGTGC TGGGCCCCAG 


CAGAAGCAAC 


840 


TACGAGAGGT TCCTGCCACC CGACGCCTTC ATCCACGTGG ACGACTTCCA 


GAGCCCCAAG 


900 


GACCTGGCCC GGTACCTGCA GGAGCTGGAC AAGGACCACG CCCGCTACCT 


GAGCTACTTT 


960 


CGCTGGCGGG AGACGCTGCG GCCTCGCTCC TTCAGCTGGG CACTGGATTT 


CTGCAAGGCC 


1020 


TGCTGGAAAC TGCAGCAGGA ATCCAGGTAC CAGACGGTGC GCAGCATAGC 


GGCTTGGTTC 


1080 


ACCTGA 




1086 



(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1654 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TTTTCTCATC 


TGTGAAACAG 


GAATAATAAC 


AGCTCTTCTC 


AGGACTCATG 


GCCTGGAGCT 


60 


TTGGTAAGCA 


GGAGATTGTC 


ATCAATGACC 


CTCACTCCTC 


TCTCCCCACT 


TCCCAGAGAC 


120 


TCTGACCCAT 


GGATCCCCTG 


GGCCCGGCCA 


AGCCACAGTG 


GTCGTGGCGC 


TGCTGTCTGA 


180 


CCACG CTG CT 


GTTT CAGCTG 


CTGATGG CTG 


TGTGTTTCTT 


CTCCTATCTG 


CGTGTGTCTC 


240 


AAGACGATCC 


CACTGTGTAC 


CCTAATGGGT 


CCCGCTTCCC 


AGACAGCACA 


GGGACCCCCG 


300 


CCCACTCCAT 


CCCCCTGATC 


CTGCTGTGGA 


CGTGGCCTTT 


TAACAAACCC 


ATAGCTCTGC 


360 


CCCGCTGCTC 


AGAGATGGTG 


CCTGGCACGG 


CTGACTGCAA 


CATCACTGCC 


GACCGCAAGG 


420 


TGTATCCACA 


GGCAGACGCG 


GTCATCGTGC 


ACCACCGAGA 


GGTCATGTAC 


AACCCCAGTG 


480 


CCCAGCTCCC 


ACGCTCCCCG 


AGGCGGCAGG 


GGCAGCGATG 


GATCTGGTTC 


AGCATGGAGT 


540 


CCCCAAGCCA 


CTGCTGGCAG 


CTGAAAGCCA 


TGGACGGATA 


CTTCAATCTC 


ACCATGTCCT 


600 


ACCGCAGCGA 


CTCCGACATC 


TTCACGCCCT 


ACGGCTGGCT 


GGAGCCGTGG 


TCCGGCCAGC 


660 


CTGCCCACCC 


ACCGCTCAAC 


CTCTCGGCCA 


AGACCGAGCT 


6GTGGCCTGG 


GCAGTGTCCA 


720 


ACTGGGGGCC 


AAACTCCGCC 


AGGGTGCGCT 


ACTACCAGAG 


CCTGCAGGCC 


CATCTCAAGG 


780 


TGGACGTGTA 


CGGACGCTCC 


CACAAGCCCC 


TGCCCCAGGG 


AACCATGATG 


GAGACGCTGT 


840 


CCCGGTACAA 


GTTCTATCTG 


GCCTTCGAGA 


ACTCCTTGCA 


CCCCGACTAC 


ATCACCGAGA 


900 


AGCTGTGGAG 


GAACGCCCTG 


GAGGCCTGGG 


CCGTGCCCGT 


GGTGCTGGGC 


CCCAGCAGAA 


960 


GCAACTACGA 


GAGGTTCCTG 


CCACCCGACG 


CCTTCATCCA 


CGTGGACGAC 


TTCCAGAGCC 


1020 


CCAAGGACCT 


GGCCCGGTAC 


CTGCAGGAGC 


TGGACAAGGA 


CCACGCCCGC 


TACCTGAGCT 


1080 


ACTTTCGCTG 


GCGGGAGACG 


CTGCGGCCTC 


GCTCCTTCAG 


CTGGGCACTC 


GCTTTCTGCA 


1140 


AGGCCTGCTG 


GAAACTGCAG 


GAGGAATCCA 


GGTACCAGAC 


ACGCGGCATA 


GCGGCTTGGT 


1200 
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TCACCTGAGA GGCTGGTGTG GGGCCTGGGC TGCCAGGAAC CTCATTTTCC TGGGGCCTCA 1260 

CCTGAGTGGG GGCCTCATCT ACCTAAGGAC TCGTTTGCCT GAAGCTTCAC CTGCCTGAGG 1320 

ACTCACCTGC CTGGGACGGT CACCTGTTGC AGCTTCACCT GCCTGGGGAT TCACCTACCT 1380 

GGGTCCTCAC TTTCCTGGGG CCTCACCTGC TGGAGTCTTC GGTGGCCAGG TATGTCCCTT 1440 

ACCTGGGATT TCACATGCTG GCTTCCAGGA GCGTCCCCTG CGGAAGCCTG GCCTGCTGGG 1500 

GATGTCTCCT GGGGACTTTG CCTACTGGGG ACCTCGGCTG TTGGGGACTT TACCTGCTGG 1560 

GACCTGCTCC CAGAGACCTT CCACACTGAA TCTCACCTGC TAGGAGCCTC ACCTGCTGGG 1620 

GACCTCACCC TGGAGGCACT GGGCCCTGGG AACT 1654 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Pro Leu Gly Pro Ala Lys Pro Gin Trp Ser Trp Arg Cys Cys 
1 5 10 15 

Leu Thr Thr Leu Leu Phe Gin Leu Leu Met Ala Val Cys Phe Phe Ser 

20 25 30 

Tyr Leu Arg Val Ser Gin Asp Asp Pro Thr Val Tyr Pro Asn Gly Ser 
35 40 45 

Arg Phe Pro Asp Ser Thr Gly Thr Pro Ala His Ser He Pro Leu He 
50 55 60 

Leu Leu Trp Thr Trp Pro Phe Asn Lys Pro He Ala Leu Pro Arg Cys 
65 70 75 80 

Ser Glu Met Val Pro Gly Thr Ala Asp Cys Asn He Thr Ala Asp Arg 

85 90 95 

Lys Val Tyr Pro Gin Ala Asp Ala Val He Val His His Arg Glu Val 

100 105 110 

Met Tyr Asn Pro Ser Ala Gin Leu Pro Arg Ser Pro Arg Arg Gin Gly 
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115 



120 



125 



Gin Arg Trp lie Trp Phe Ser Met Glu 
130 135 



Pro Ser His Cys Trp Gin 
140 



Leu Lys Ala Met 
145 



Gly Tyr Phe Asn Leu Thr Met 
150 



Tyr Arg Ser 
160 



Asp Ser Asp He Phe Thr Pro Tyr Gly Trp Leu Glu Pro Trp Ser Gly 

165 170 175 



Gin Pro Ala His Pro Pro Leu Asn Leu 

180 135 



Ala Lys Thr Glu Leu Val 

190 



Ala Trp Ala Val 
195 



Asn Trp Gly Pro Asn Ser Ala Arg Val Arg Tyr 
200 205 



Tyr Gin 
210 

His Lys 

225 



Leu Gin Ala His Leu Lys Val Asp Val Tyr Gly 

215 220 



Leu Pro Gin Gly Thr Met Met Glu Thr Leu 
230 235 



Lys Phe Tyr Leu Ala Phe Glu Asn Ser Leu His Pro 

245 250 



Arg Tyr 

240 



Tyr He Thr 

255 



Glu Lys Leu Trp Arg Asn Ala Leu Glu Ala Trp Ala Val Pro Val Val 

260 265 270 



Leu Gly Pro 
275 



Ser Arg Ser Asn Tyr Glu Arg Phe Leu Pro Pro 

280 285 



Phe He His Val Asp Asp Phe Gin Ser Pro Lys Asp Leu Ala Arg Tyr 
290 295 300 

Leu Gin Glu Leu Asp Lys Asp His Ala Arg Tyr Leu Ser Tyr Phe tog 
305 310 315 320 



Trp Arg Glu Thr Leu 

325 



Arg Pro Arg Ser 



Phe Ser Trp Ala Leu Ala Phe 
330 335 



Lys Ala Cys Trp Lys Leu Gin Glu Glu 
340 345 



Arg Tyr Gin Thr Arg 

350 



Gly He Ala Ala Trp Phe Thr 
355 
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WHAT IS CLAIMED AS NEW AND DESIRED T O BE SECURED BY 
LETTERS PATENT OF THE UNITED STATES IS: 

1. A polypeptide having an amino acid sequence selected from 
the group consisting of: 

(a) a first subsequence corresponding to amino acid 
positions 1 to 301 of Fuc-TIII connected to the 
amino terminus of a second subsequence corresponding 
to amino acid positions 301 to 359 of Fuc-TVI; 

(b) a first subsequence corresponding to amino acid 
positions 1 to 160 of Fuc-TIII connected to the 
amino terminus of a second subsequence corresponding 
to amino acid positions 160 to 359 of Fuc-TVI; 

(c) a first subsequence corresponding to amino acid 
positions 1 to 300 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
amino acid positions 302 to 361 of Fuc-TIII; 

(d) a first subsequence corresponding to amino acid 
positions 1 to 159 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
amino acid positions 161 to 361 of Fuc-TIII; 

(e) a first subsequence corresponding to amino acid 
positions 1 to 33 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 34 to 69 of Fuc-TVI connected to the amino 
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terminus of a third subsequence corresponding to 
amino acid positions 71 to 361 of Fuc-TIII; 

(f) a first subsequence corresponding to amino acid 
positions 1 to 70 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 70 to 85 of Fuc-TVI connected to the amino 
terminus of a third subsequence corresponding to 
amino acid positions 87 to 361 of Fuc-TIII; 

(g) a first subsequence corresponding to amino acid 
positions 1 to 86 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 86 to 98 of Fuc-TVI connected to the amino 
terminus of a third subsequence corresponding to 
amino acid positions 100 to 361 of Fuc-TIII; 

(h) a first subsequence corresponding to amino acid 
positions 1 to 99 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 99 to 129 of Fuc-TVI connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 131 to 361 of Fuc-TIII; 

(i) a first subsequence corresponding to amino acid 
positions 1 to 130 of Fuc-TIII connected to the 
amino terminus of a second subsequence corresponding 
to positions 130 to 159 of Fuc-TVI connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 161 to 361 of Fuc-TIII; 
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(j) a first subsequence corresponding to amino acid 

positions 1 to 33 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 34 to 70 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 70 to 359 of Fuc-TVI; 

(k) a first subsequence corresponding to amino acid 

positions 1 to 69 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 71 to 86 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 86 to 359 of Fuc-TVI; 

(1) a first subsequence corresponding to amino acid 

positions 1 to 85 of Fuc-TVT connected to the amino 
terminus of a second subsequence corresponding to 
positions 87 to 99 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 99 to 359 of Fuc-TVI; 

(m) a first subsequence corresponding to amino acid 

positions 1 to 98 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 100 to 130 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 130 to 359 of Fuc-TVI; 

(n) a first subsequence corresponding to amino acid 

positions 1 to 129 of Fuc-TVI connected to the amino 
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terminus of a second subsequence corresponding to 
positions 131 to 160 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 160 to 359 of Fuc-TVI; 

(o) a first subsequence corresponding to amino acid 

positions 1 to 33 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 34 to 98 of Fuc-TVI connected to the amino 
terminus of a third subsequence corresponding to 
amino acid positions 100 to 361 of Fuc-TIII; 

(p) a first subsequence corresponding to amino acid 

positions 1 to 86 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 86 to 159 of Fuc-TVI connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 161 to 361 of Fuc-TIII; 

(q) a first subsequence corresponding to amino acid 

positions 1 to 33 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 34 to 99 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 99 to 359 of Fuc-TVI; 

(r) a first subsequence corresponding to amino acid 

positions 1 to 85 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 87 to 160 of Fuc-TIII connected to the 
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amino terminus of a third subsequence corresponding 
to amino acid positions 160 to 359 of Fuc-TVI; 
(s) a first subsequence corresponding to amino acid 

positions 1 to 99 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 

is 99 to 159 of Fuc-TVI connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 161 to 361 of Fuc-TIII; 
(t) a first subsequence corresponding to amino acid 

positions 1 to 98 of Fuc-TVI connected to the amino 
terminus of a second subsequence corresponding to 
positions 100 to 160 of Fuc-TIII connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 160 to 359 of Fuc-TVI; 
(u) a first subsequence corresponding to amino acid 

positions 1 to 99 of Fuc-TIII connected to the amino 
terminus of a second subsequence corresponding to 
positions 116 to 166 of Fuc-TV connected to the 
amino terminus of a third subsequence corresponding 
to amino acid positions 161 to 361 of Fuc-TIII; 

2. A DNA sequence, comprising a DNA subsequence wherein 
said DNA subsequence encodes a polypeptide according to Claim 
1. 

3. A plasmid, comprising a DNA sequence according to 
Claim 2. 
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4. A transformed cell, wherein said cell contains a 
plasmid according to Claim 3. 

5. A method of producing a polypeptide, comprising 
culturing a cell according to Claim 4* 



WO 97/09421 



1/27 



A 



3 l~ 



«e» O 



In 

33 



*> — — 3> n n 



> 



x> 
x? 
39 
39 
-> 

o 



St 

n 

Ifl 

33 
A 

a 

3> 
3? 
3* 
> 

59 

y> 

39 

s> 

3» 
3* 
3> 
3> 
39 
> 
3* 
3> 

39 
3? 
3? 
33 



3» 
3* 



> 



a 
n 



a 
to 



n ft 



n 

m 
a 
n 
n 



> 
3» 

a 
a 



a 



ft 

3# 
ft 



4 + 

a 



ft 

3» 



3» 
3» 
3» 



3» 

a 
a 
39 
a 

«■» 
n 
-* 
a 
a 

39 
3» 



3» 
3» 
> 



* * 



3* 

a 
a 

#^ 

3> 



a 

n 

o 
> 

3* 



n 

3> 
3* 

* ■ 

3» 
3* 
> 

3» 

a 

9 



CI 
3> 

a 

a 

o 
a 
n 



m 

ft ■* ** 

S» — 

r. n 



e 

w 

* * 



ft 
n 
n 

> 

a 

5 

«=» 
«i 

> 
n 

3? 
O 

o 



n 
> 



r. 

n 
s» 
a 
a 
a 



29 

a 



•4 

O 

> 
a 



3> 
3* 

a 



n 

R 



> ■ 
> 
p 

ft 



ft 

S 



n 



3> 



n 
n 



n 
n 

• t 



* i 
n 



n 
o 

n 

n 

n 

n 

n 
n 

> 
n 



n 



CI 

n 



r. 

r. 



c 

r*. 

* 

n 

& 
n 
p-» 
o 

E 

n 
n 

n 

n 

M 

n 
r* 
r» 

o 
&> 
n 



n 
o 



R 

n 

R 
3 



n 



* * 



r. 

r. »-i m 
> M — 

— M ^ 

» r. 
n r. 

— E 
" n h 

— n n 

r. « »-i 
n 
r» 

n 
-> 

n 

i 

n 

n 
r. 
n 



54 

n 
n 



n 
r. 

c 

n 

n 



1 

n 

s 

r*. 



r* 
r 

n 

c. 
n 
r> 

c 

n 
n 
n 

K 

n 
n 

Q 

v * 

n 

37 



n I- n 
-4 n FS 



r» 



r. 



2 a 



3? R 

n 

3> 

r» 

n 
n 

n 

n *b 

n 

o 

n en 

r» 

"9 

n (a 



r> n 

a? cr 

n 

n 

n n 
n 

s G 

39 

3? O 

39 

n 

* * 

3> (O 

3» r: 



3> ft 

n 

go 

3» 

n 
o 

39 



3* 



2 "* 

m • 

> W 

v ■ 



r» 
n 

n «a 

3» 

n 

o -a 

n 
a 

f?° 

n 

n a? 
r» 

-» «=» 
n 

39 

O " 

r\ 

y s: 



rt s -4 



9 * 

9 o 



3> C 



«=1 

39 O 
> 

n ia 

o 

-> 

n 
rt 

n 



o 
n 

n 39 

n 
n 

a s 

n 



? ° 

3» f 



n 

3» C 

3» 

O 

39 C 

n 
o 

O 39 

n 

r» 

n s 

n 

*^ 

■ • 

n 

3» 

^ w 



3» «s 



» 9 
* • 



o 

3? Fi 

n 
n 

•4 

n to 

n 

■* 

-» e 

n 

3> S 



3> c 



3i 



a 

n 
? 

3> n 

a 

rt a 
n 

CI o 

n 

3» 

3> S 

n 
n 

C 3> 



9 33 



3> O 

n (a 

3» 

* # 

39 

CI D 

n 
o 

•* <c 

n 

n 

n s 
ri 

39 «=I 

n 

39 C3 

n 
n 

> co 

n ia 

n 

o 



w — — 



2« O 
39 s 



r» 

39 

3> IX 

a 
n 

-» <c 

n 

n 

39 O 

n 
o 

o 

n 
n 

a o 

39 

n 

n a 
n 

a u 
i 

s» s 



> e 



n wi 



n 

39 



O 

n o« 

o 

o 

n «g 

n 

«> 

3> d 

n 
A 

O S) 

o 

-4 

a a 

ct 

n 

a 
a 

3» R 

n 



H 
■« 

H CA 

ci 

a o 

n 

n 

39 co 
a 

n «o 
-» 

a 

a* 

n 

3> 13 

n 

n «^ 

39 

n 

n «s 

a 

o 

3^ 

3> 

3> & 
1 



39 — r- 



2 
a 



* * 



* * 

* * 

a 

5" 

n 
a 

* * 

a a 
n 

■ 

3»- 

a fA 

3» 



39 S 



39 

39 SX 

a 

* * 

ci 

39 
39 



a* 

39 W 

a 
> 

* * 

ft 



a 

ft 39 



* * 

a 



> s 



n vi 

ft 

ft S5 

* * 

3* 

a s 
a 
n 
ft 

a 

n 

a 

a a 

a 

n 

39 o 

a 
a 



a a 

a 

> 



3» 
ft 



a 
a 



ft «3 
9 

i 

ft «5 

ft 

ft 

ft «3 

-) 

J>- 

39 S 

ft 

M 

a n 

n 

ft 

39 O 

s 

39 S 



J *^ ~ E ~ ^ 3^ ^ 



• ■ 
3» 

ft «4 

ft 

a 

3> F) 

a 
ft 

a 
a 

ft 59 



a a — 



r. n 



ft ;*» 

3* 

■ * 

39 PI 

• • 

39 

* * 

— « 

a 
ft 

ft «5 

ft 

a 

a a 
o 

p • 
ft 

39 

a 

n s? 

r» 

a 

a? © 
r» 

a n 

ft 
n 

39 a 
ft 

3> 



ft 



ft 

a 

39 © 

a 
ft 

a 33 
ft 

39 

39 CT 

a 
a 

-> <s 
a 

-a 

39 «=J 

ft 

ft 

ft «o 

39 

ft 

3? O 

a 
a 

ft 19 
39 

a 

39 C 
ft 

39 



ft > - Z. 

a 

ft w - - 



ft 

ft «3 

a d 

^% 
»« 

a 

ft 39 
•4 

n 

o <a 
ft 

39 - 

a ca 

•« 

a 

2° 

ft (A 
ft 

ft CA 

ft 

ft 

a a 

3* 



-* > o 



3> 
ft 

39 

* 

ft 
ft 



ft ■© 

ft 



a o 

n 

n 

ft «3 
n 

ft -4 

ft 



ft 
ft 

a 

39 







> 








> 












> 








> 








■ * 








* • 


> 


a 


< 




X 


* 




■ w 




• * 






* 


39 

• 






a 






3» 
















3D 




39 


















3"' 












m 1 

39 




a 
a 








-4 




a 














e 






■ 
















39 




- * 












> 

• t 








ft 






39 




39 




3* 




39 




-4 












■ * 




ft 








•4 




ft 








ft 


CA 


* • 




ft 




ft 








> 




39 


* 


a 




ft 


•4 


> 






«C 


ft 




ft 




n 




> 




ft 




« * 






S 


ft 






3> 


















3» 








• • 






sr 










• • 




1 




■ • 








> 






CA 




'.1 






39 






























• * 




* • 

ft 









a r. 

ri- 
ft 39 
3> •* 

b * 

3 » 

39 / 

39 a 

a 

a 

ft «o 

39 r 
ft 

39 <0 

3J 

8 s 

ft 

ft «0 
3> 

s « 

a * 



r. s 

* • 

- <t 



> 

> Z 



2/27 
Figure 2 



PCT/US96/1I3816 



WO 07/09421 



# 



3/27 



figure 2 
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Figure 3 (con^d) 
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Sequence III (cone.) 
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Figure 2 (conr*d) 
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Figure 4 
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Figure 4 (cont'd) 
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Figure 4 (cont'd! 
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Figure 4 (cont'd) 
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Figure 4 (cont'd) 
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